(NASA-CR- 135498) DETECTION AND ESTIMATION N73-32335^ 

OP AN OPTICAL IMAGE BY PHOTON-COUNTING 
TECHNIQUES Ph.D. Thesis (California 

Univ.) 218 p HC $13.00 CSCL 14B Unclas 

G3/14 15021 ) 


UNIVERSITY OF CALIFORNIA 
SAN DIEGO 

NGL 05 -0Q9 -079 1 

Detection and Estimation of an Optical 
Image by Photon-Counting Techniques 


A dissertation submitted in partial satisfaction of the requirements 
for the degree Doctor of Philosophy in 
Information and Computer Science 


by 


Lily Lee Wang 


Committee in charge: 

./ 

Professor Carl W, Helstrom, Chairman 
Professor Adolf W. Lohmann 
Professor Elias Masry 
Professor Michael J, Sharpe 
Professor Harold W. Sorenson 


1973 



The dissertation of Lily Lee Wang is approved, 
and it is acceptable in quality and form for 
publication on microfilm: 




University of California, San Diego 


1973 


Table of Contents 


Page. 

List of Figures . vi 

List of Tables . . . . , x 

Acknowledgements . .... . . ... xi 

Vita. - xii 

Abstract . . . . . . . • . . . . xiii 

Introduction 1 

Chapter I. Image formation and photon-counting 

statistics 

Image formation when diffraction theory is 

applied . 9 

Intensity representation of an incoherent 

object • « * • 14 

The statistical description of a photoelectric 
detector. . 16 

Distribution of the photoelectrons from a single 

small area when the time-bandwidth product 

IW » 1 18 

Distribution of the photoelectrons from two 
neighborhood areas when TW >> 1 . . . . . 26 

Distribution of the sum of the photoelectrons from 
a large number of small areas when TW >> 1 . 30 

Chapter II. Binary detection of the object light from 

a photosensitive surface. . . . . 37 

Binary hypothesis test ....... 38. 

The ideal photoelectric detector 42 

Threshold detector 50 

Simple detector 53 

Reliability and error probability of the ideal 
detector in the absence of the background light . 


iii 


55 



Page 

Reliability and error probability of the ideal 
detector in the presence of the background light , 57 

Performance of the ideal detector for a Gaussian 
image ... .........65 

Performance of the threshold detector for a Gaussian 
image in the presence of background light . . .82 

Performance of the simple detector for a Gaussian 
image in the presence of background light ... 88 

q o 

Discussion • 


Chapter III. Simultaneous detection and estimation 

of the mean intensity of an optical image 96 

Simultaneous detection and estimation for a single 
parameter 97 

Bayes strategy for detecting the image and estimating 
its intensity with a quadratic cost function . . 105 

Maximum-likelihood strategy and the almost optimum 
intensity estimate 110 

Performance of the almost optimum intensity estimate 
for a truncated Gaussian or a parabolic image . .113 

Discussion. 134 


Chapter IV. Detectors for resolving two point sources 
in optical communication by photo-counting 
techniques 136 

Ideal detector for resolving two light sources. . 137 

Ideal detector for binary detection in optical 
communication 141 

Simple detector for detection of binary bits . . 149 

Counting comparator for detection of binary bits . 163 

Image location effects on the detectors in the 
absence of background light ...... 169 


iv 



Page 


Discussion, * . * 177 


List of References, 


181 


Appendix A. 

Asymptotic expansion of the tail distribution 
by the method of steepest descent. 

184 

Appendix B. 

Iteration procedure for finding the saddle point 
for the method of steepest descent 

198 


v 



List of Figures 


Figure 1 
Figure 2 


Figure 3 
Figure 4 


Figure 5 


Figure 6 


Figure 7 


Figure 8 


Figure 9 


Figure 10 


Page 

Image formation system * 9 

Block diagram of hypothesis test for binary 
detection. 41 

Photoelectrical emission surface . , . . 42 

Detection probability Q d as a function of N g 
defined in (2.38) for the ideal detector in 
the absence of background light; false-alarm 
probability Qq “ 0.01, 0.1, 0.2 .... 69 

Probability Qq and probability Q d as a function 
of decision level g Q for the ideal detector cal- 
culated by the saddle point, Gaussian , and gamma 
approximations ;N q - N g = 5, where Nq is defined 
in (2.57) 73 

Probability Q d as a function of the signal-to- 
noise ratio defined in (2.58) for the ideal 

detector; Nq = 0.5, 5, Qq = 10 10 ^ . . 75 

Probability Q . as a function of N for ideal, 

ci ® „3 

threshold and simple detectors; Qq = 10 

at Nq = 0, 0.5, 5. . . . . . . • 77 

Probability Q . as a function of N for ideal, 
d 3 _ 

threshold and simple detectors, Qq = 10 at 

Nq = 0, 0.5, 5 79 

Average error probability P g in detecting the 

Gaussian image against the uniform background 

light vs. the mean number N for ideal and 

s 

simple detectors; Nq = 0,0.01, 0.5, 5 . 81 

Probability Q q and probability Q d vs. decision 
level gQ for threshold detector; N Q = 0.5, 


vx 



Page 


D I 2 = 0.1, 1, 2, 3, 4, 6 . . . ... . 87 

Figure 11 Block diagram for simultaneous detection and 
estimation of a single parameter 6 from the 
signal * • 104 

Figure 12 Image surface divided into rings. . . . 115 

vs 

Figure 13 Mean intensity estimate I vs. relative inten- 
sity Ig for truncated Gaussian image; Qq = 10 , 

total number of rings is 100. Mean' background counts 

u = 0.1 at R = o = 1 for time T . Curves are 
K A 

indexed by parameter defined in (3.44) . 120 

a 

Figure 14 Mean estimate I ga vs. relative intensity I g for 
truncated Gaussian image; Qq = 10 , y = 0.1 

at R = 1 for 100 rings for time T. Curves are 
indexed by the parameter M defined in (3.41). 122 

A 

Figure 15 Mean estimate I ga vs. relative intensity I g for 

truncated Gaussian image; Q q = 10 , M = 100, 

p = 0,1 at R = 1 for time T. Curves are indexed 
A 

by observation intervals T 12 4 

A 

Figure 16 Mean estimate I ga vs. relative intensity I g for 
parabolic image; Qq = 10 2 , y = 0.1 at R Q = 1 
for time T. Curves are indexed by parameter 
M 128 

A 

Figure 17 Mean estimate I vs. relative intensity I for 
parabolic image; Qq = 10 , y = 0.1 at R c = 1 

for time T. Curves are indexed by observation 
intervals T 130 

Figure 18 Error vs. mean counts y for parabolic image; 

-2 

Qq = 10 . Curves are indexed by parameters 

I defined in (3.47) and M ■ * * 132 

b 


vii 



Page 


Figure 19 


Figure 20 


Figure 21 


Figure 22 


Figure 23 


Average error probability P fi in binary bits 

detection vs, distance of separation 

(2 x q/cO between images for ideal detector, 

counting comparator, or simple detectors in the 

absence of background light; N - 0.1, 1, 2, 

8 

5 . . . 148 

Average error probability in binary bits 
detection vs. distance (2^/o) for ideal 
detection and simple detector with fixed 
area or optimum area in the absence of the 
background light; N = 5 . . . . . 156 

Average error probability P^ in binary 

detection vs. average number N . for simple 

s 

detector, counting comparator with square 

2 

observation area A’ = 4a at Rq = 1 and 

Xq/o = 2, N q = 0, 0.1, 0.5, 1 158 

Average error probability P in binary 

detection vs. average number N g for simple 

detector and counting comparator with square 

2 

observation area A 1 = 4a at Rq ** 1 and 

N Q - 0.1; x Q /a = 1, 2 160 

Average error probability in binary 

detection vs. average number difference AN for 

simple detector and counting comparator 

where AN = with, as the mean number 

of photoelectrons observed from a finite area of 

one-half of the surface under hypothesis for 

k-0, 1; n TQ = 0.5, 1, 2. . . . . , . 162 


viii 



Page 


Figure 24 Error probability P g of the counting com- 
parator vs. the length Rq for the rectangular 

observation area; N =5, The curves are 

s 

indexed by the parameter defined in (2.5?) • 168 


Figure 25 


Figure 26 ' 


Figure 27 


Average error probability P in binary 
bits detection vs. the ratio |e|/x^ due to the 
dislocation of the images for the ideal 
detector or the counting comparator at x q/° “ 1; 

N = 2, 5, 8. . . . . . . 

Average error probability in binary 
bits detection vs. the ratio |s|/xq due to 
the dislocation of the images for the ideal 
detector and the counting comparator, for 
fixed value N g = 5; x^/cr = 1.5, 2, 4. . , 

Saddle point s^ vs. the signal-to-noise 
ratio for a Gaussian image; N n = 0.5, 5, 

Q q - 10 10 . . 


174 


17 6 


203 




.ix 



Page 


List of Tables 


Table 1 
Table 2 
Table 3 
Table 4 


Optimum radius for simple detector. . . 

Optimum distance for simple detector . . . 

Percentage error of exponential distribution 
Percentage error of Poisson distribution . 


94 

179 

196 

196 



x 



Acknowledgements 


The author wishes to express her appreciation to Professor 
Carl W. Helstrom who introduced her to this field of study and 
provided the direction and encouragement throughout its development. 
Thanks are due to her many professors who provided the insight into 
the theories and techniques that make this dissertation possible. 
Appreciation is also extended to both Dottie Bybee and Sue Pettijohn 
who typed the manuscript. 

The author wishes to acknowledge the financial support 
provided by the National Aeronautics and Space Administration for 
the first three years and the National Science Foundation for the 
fourth year since 1969. This research was carried out under NASA 
Grant NGL 05-009-079 and NSF Grant GK-33811. 

Finally, the author’s deep appreciation is felt for her 
husband, Charles, for his encouragement and great consideration 
during these years of graduate study. 


xi 



VITA 



1962 - 

1963 - 1965 

1965 - 1967 
1967 - 1969 


M.S.E.E., Oklahoma State University, Stillwater 

Engineer, Fairchild Control Corporation, Los Angeles, 
California 

Project Engineer, Endevco Corporation, Pasadena, 
California 

Technical Staff Member, Bellcomm Inc., Washington , D.C. 


PUBLICATIONS 


Wang, L. L. Numerical calculation of cumulative probability from 
the moment-generating function. Proc . IEEE , vol. 60, pp. 1452- 
1453, November, 1972. 

Helstrom, C. W. and L. L. Wang. Optimum detection of an optical 
image on a photoelectric surface. IEEE Trans . on Aerospace and 
Electronics System (in press). 


FIELDS OF STUDY 


Major Field: Signal Processing 

Detection and Estimation Theory 
Communication Theory 
Electronic Engineering 


xii 


ABSTRACT OF THE DISSERTATION 

Detection and Estimation of an Optical 
Image by Photon-Counting Techniques 

v 

by 


Lily Lee Wang 

Doctor of Philosophy in Information and Computer Science 
\ s 

University of California, San Diego, 1973 
Professor Carl W. Helstrora, Chairman 


The statistical description of the photoelectric detector is 
given. The photosensitive surface of the detector is divided into 
many small areas, and the moment generating function of the 
photo-counting statistic is derived for 'large time-bandwidth product. 
The detection of a specified optical image in the presence of the 
background light by using the hypothesis test is discussed. The ideal 
detector based on the likelihood ratio from a set of numbers n of 


xiii 



photoelectrons ejected from many small areas of the photosensitive 
surface is studied and compared with the threshold detector and a 
simple detector which is based on the likelihood ratio by counting 
the total number of photoelectrons from a finite area of the surface. 
The intensity of the image is assumed to be Gaussian distributed 
spatially against the uniformly distributed background light. The 
numerical approximation by the method of steepest descent is used, 
and the calculations of the reliabilities for the detectors are 
carried out by the digital computer. An almost optimum intensity 
estimate is proposed when the intensity of the object light is 
unknown, and its statistical performance is studied. The application 
of the photon-counting techniques is further discussed where 
detectors are investigated for resolving two point sources with 
Gaussian images. 
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Introduction 


An optical communication system, owing to its high 
information-carrying capacity both in the temporal and the spatial 
channels, is more advantageous than electromagnetic wave systems of » 
lower frequency. In general, the retrieval of the information will 
rely heavily on prior knowledge about the field and the performance of 
the detecting system. Usually the optical field at space point and 
time t is not a measurable quantity, and only its average power at 
that point can be observed. A photoelectric detector, such as the 
photo-tube, the photomultiplier, the image tube, or the image orthicon, 
which carries the information about the optical field in the sense 
that the probability of the photoelectron emission is proportional to 
the intensity of the light at its photosensitive surface, plays an 
important role in the optical' system for the detection and extraction 
of the information. For example, in observational astronomy it is 
often a practice to use an image tube or a photomultiplier to detect 
a star for the purpose of observation and navigation; in a binary 
communication system a phototube may be used to detect a bit "l" when 
a specified optical object is received. 

In most applications, the background light, referred to as the 
"noise’', may also pass through the optical system and thus corrupt the 
object light, referred to as the "signal". When the signal-to-noise 
ratio is low, the observer will not only be required to detect the 
optical signal, but also to seek the best strategy he can to make the 
decision whether there is only the noise or whether the object signal 
is also present. One way to observe the optical field by using a 
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photoelectric detector is to count the number of ejected photoelectrons. 
That is, as the photosensitive surface of the detector is divided like 
a mosaic into many small equal areas, the set of data can be obtained 
from observing the numbers of the photoelectrons ejected from each 

* 

such area during the time interval (0, T) . Because of the stochastic 

nature of the optical field, each of the observed data will be a random 

variable. To make decisions, detection theory and the concepts of 

statistical hypothesis testing can be applied when the object signal is 

specified. If there are also some parameters of the object signal 

unknown, the estimation of such parameters as well as the detection 

must be carried out. In this thesis we will discuss the basic binary 

detection and single parameter estimation of an optical image and some 

of their applications in optical communication systems by using 

photon-counting techniques. 

A point source, at a known location, emits incoherent 

quasimonochroma tic light of given spectral density and is focused on 

the photosensitive surface of a photoelectric detector. Its 

electromagnetic field^\ which is spectrally pure, is assumed to be a 

stationary, spatio-temporal, circular Gaussian random process. A 

brief description of the image formation and statistical description 

of the photoelectric detector will be given In Chapter I. As the 

observation interval T is much greater than the reciprocal of the 

bandwidth W \ the distribution of the number of photoelectrons ejected 

from a single small area of the photosensitive surface, discussed by 
(2) (3) 

Mandel and Helstrom , will be derived in this chapter, following 

(A) 

Siegert , and it can be approximated by a Poisson distribution 
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function. The derivation will be then extended to the approximation 
that the numbers of photoelectrons ejected from the different small 
areas of the surface will be Poisson distributed and statistically 
independent from each other. The moment-generating function (m.g.f.) 
of the statistic, which is the sum of the number of photoelectrons 
ejected from many small areas as these areas become infinitesimally 
small, will also be discussed. 

When the optical signal is specified, the simple binary 
hypothesis tests can be used. Two important strategies, Bayes and 
Neyman-Pearson, will be briefly described in Chapter II. The optimum 
statistic based on the likelihood ratio and its ra.g.f. for the 

( 5 ) 

spatially varying optical signal, discussed previously by Heist rom »• 

will also be given. The performance of the ideal detector, in the 
absence of the background light, will be discussed , and its detection 
probability will be calculated for a preassigned false-alarm 
probability. When background light also passes through the system, 
only an approximate form of the distribution can be used. Because of 
the complexity of the m.g.f. of the statistic, a Gaussian approximation 
was suggested by Helstrom^. When the signal-to-noise ratio becomes 
large, the Kth order cumulant of the statistic increases as K increases, 
and the Gaussian approximation is not valid. Farrell ^ recommended 
the gamma distribution approximation. We have used the method of 
steepest descent to approximate the distribution with a uniform 
asymptotic expansion series following Rice as also discussed by 
Daniel v . The performance of the optimum detector has been 



investigated with a Gaussian image and uniform background light, where 
the detection probability at preassigned- values of false-alarm 
probability, and the average error probability will be calculated by 
using the digital computer. As the signal-to-noise ratio is small, 
the optimum detector can be approximated by the threshold detector and 
will be also discussed. Besides the ideal detector, which registers 
the locations of the ejected photoelectrons from the surface, a 
simple detector, which observes the total photoelectrons ejected from 
the surface without dividing it into many small areas, will be also 
discussed. The performances of both the threshold detector and the 
simple detector will be investigated with the Gaussian image and 
uniform background light. Comparison between these. three detectors 
will be made with the evaluation of the average error probabilities 
and of the detection probabilities at preassigned values of the 
false-alarm probabilities. 

When the intensity of the point source, which is located in 
the field of view, is unknown, the observer must also estimate the 
intensity of the image simultaneously if he makes the decision that 
the optical signal is present. Since the observed data are a 
set of random variables, no two experiments will yield the same value 
of the intensity estimate even though the true value of the intensity 
is the same in both. The most one can hope for is that the estimate 
will be close to the true value of the intensity in the sense of "on 
the average". Two most important strategies, Bayes and 
maximum-likelihood, will be discussed in Chapter HI. The intensity 
estimate, derived from the Bayes strategy, which has been discussed 
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also by Middleton and Esposito^^ , will be investigated with a quadratic 
cost function where the prior probability density function (p.d.f.) of 
the intensity will be assumed to be a gamma distribution function. 

When the signal-to-noise ratio is not too weak, the Bayes estimate of 

> 

intensity will be approximately equivalent" to the maximum-likelihood 
estimate. An almost optimum estimate therefore will be proposed, and 
its statistical performance will be investigated with a truncated 
Gaussian or a parabolic image. The expectations of the estimate will 
be calculated by the digital computer at different values of 
parameters such as the duration of the observation interval and the 
radius of observation area, all at a preassigned value of the 
false-alarm probability. 

In Chapter IV, we will further discuss some applications to 

optical communication of the photon-counting techniques. For example, 

\ 

the ideal detector derived from the likelihood ratio given in Chapter 
II can be applied to resolve two point sources with equal radiant 
power at known locations from a single point source with twice the 
power located between them. The m.g.f. of the ideal detector and its 
cumulants will be given. Since the m.g.f. of the ideal detector bears 
a complicated form with double integration, we will not practice the 
numerical calculation though it can be carried out by the digital 
computer. Instead, the ideal detector for detecting a bit "1" when 
light from a point source located at x is received or a bit "0” when 
a light from a point source located at -a: is received will be 
investigated numerically in the absence of any background light. A 
Gaussian image will be postulated, and the calculation of the error 



probability, which is dependent on the separation distance of the 
point sources, will be carried out. Two more detectors, the simple 
detector that counts the total number of photoelectrons ejected from 
half of the surface of the photoelectric detector and the counting 
comparator that compares the numbers of photoelectrons observed from 
the upper and lower half-surfaces of the photoelectric detector, will 
be also discussed, and their error probabilities will be evaluated. 

As the background light also passes through the aperture, the average 
error probabilities of both the simple detector and the counting 
comparator will be calculated with a finite square observation area. 
Since the performances of the detector for detecting binary bits will 
also depend on the prior knowledge about the locations of the images, 
the effects of mislocation on the ideal detector and counting 
comparator will be discussed. 
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Chapter I Image formation and photon-counting statistics 

In this chapter, the general terms such as intensity, co- 
variance function, and point-spread function in image forming and 
processing will be discussed. The optical field v will be considered 
as a complex circular Gaussian stochastic process which is stationary 
ergodic and spectrally pure. In other words, the process can be 
described by the complex random functions of space _r and time t, 
V^(r,t) and (£>£)> where V^(i:,t) and V^C^t) are the real and 
imaginary parts of the process. The statistical description of the 
photoelectric detector will be given. When the observation time T 
is much greater than the reciprocal of the bandwidth W \ the numbers 
of the photoelectrons ejected from different small areas of the 
photosensitive surface will be proved to be Poisson distributed and 
statistically independent of each other. The moment generating 
function (m.g.f.) of the photon-counting statistic will be derived 
in a general form and discussed. * 
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Image formation when diffraction theory is applied 

The diffraction theory of image formation has been largely 
used to describe imaging systems in which the process is considered 
both linear and spatially stationary. The basic elements for an 
imaging system are the luminous object o, aperture A (for example, 
a lens) and image I located in the planes U, A, R respectively as 
shown in Fig. 1, where z^ and are the distances of the object 
plane U and the image plane R from the aperture A. 



Fi'gure 1 Image formation system 

The optical field v at the point (u,z^) of the object plane at time 
t is assumed to be linearly polarized and quasimono chroma tic and can 
be characterized by 



» 


( 1 . 1 ) 



where 


V(u,z is t) - V 1 (u,z 1 ,t) + iV 2 (u,2 lf t) 


( 1 . 2 ) 


is the complex amplitude of the electric field and V^(u,z^,t), 

V 2 (u,z^,t) are its real and imaginary parts, which are Gaussian ran- 

( 1 ) 

dom processes . is the angular frequency at the center of the 

object spectrum. "R£" indicates the real part of the analytic signal 
V(u,z_,t) As the distances z^ and are much greater than the 

wavelength of the light, the complex amplitude V(^r,z 2 ,t) at point 
(r,z 2 ) of the image plane R at time t due to the object light can be 
expressed ^ aG 


V(r,z 2 ,t) = ^V(u,z 1 ,t) K(u»r) d 2 u 


( 1 . 3 ) 


where K(jJ,_r) is the amplitude point-spread function describing the 
propagation from the object plane to the image plane, o is the area 
of the object. 

The instantaneous intensity at any point (x,z) of an arbi- 
trary plane X away from the aperture with a distance z and at time t 
is defined by 


I(x,z,t) = V(x,z,t) V (x,z, t) f 


( 1 . 4 ) 


where V (x,z,t) is the complex conjugate of the amplitude VQc,z,t), 



which is not directly measurable; the observable quantity is the 

(3) 

ensemble average intensity defined by 

<I(x,z»t)> « <V(x,z f t) V*(x,z,t» . (1.5) 


When the process is also ergodic, the ensemble average is equivalent 

to the time average through the interval (-T,T) as T «>, 

The field is normalized so that the instantaneous power 

density at point (u,z^) of the object plane at time t is 
X 2 

l v 0d» z i ,t: )| > an< 3 t ^ e complex covariance function between the complex 
field at point (u_j,,z^) anc * the complex-conjugate field at point 
(Uj,z^) of the object plane at times t^ and respectively is defined 
by • 

Tte iS Uyt if t. : z ± j ^ ^<V(jx ±t z v t t ) V^(u j ,z 1> t j )> . (1.6) 


(1.6) is knovm as the complex autocovariance function^; it is pro- 
portional to the mutual coherence function with a factor 
ifl (t.-t.) , 

e 0 1 J * r * s t ^ e avera S e power density at point 

^i ,Z l^ °^^ ect pl ane along the direction z. For an optical 

) 

oystem that can be approximated by only considering the paraxial rays 
and has small bandwidth W such that W « the property of the 
spectrally pure light is characterized by Mandel^^ in such a way that 
the superposition of light beams will not affect the spectral distri- 
bution; the covariance function from (1.6) is then reducible to the 



product of two simpler functions as 


rCH i *iij»t: 1 ,t^:z 1 ) - <Ku i# u yZ^ xCt* - O , (1.7) 

> 

where <f> (u. ,.u. , z ) and x(tj “ t.) are the complex spatial and temporal 
1—1 **■ 3 3 

( o\ 

coherence functions^ . In general x( T ) I s normalized so that 

x(0) ~ 1 and x(' f ) = X (“ T ) for teal power spectrum (t is real). The 

ensemble average intensity from (1.5) can also be written as 


<l(x,z,t)> = 2r(x >( x, t ,t:z) , 


( 1 . 8 ) 


(9) 

which is the variance of the process V(x, z,t) or the sum' - of the 
variances of the two independent processes V^(x,z,t) and V 2 (x 5 z,t) 
at point (x,z) of the plane X and at time t. 

As the complex amplitude V(u,z^,t) is defined to be zero out- 
side the object o, (1.3) can also be expressed by a convolution 
(3) 

equation for the paraxial approximation t that is 


V(r # z 2 ,t) = 



(1.9) 


The average intensity at point (r^z^ of the image plane due to the 
object light according to (1.5) is 



13 


co _ co 


—co — oo 


co _ co 


d 2 u 



d 2 U 


—co “—co 




(%».“ »t,t:z 1 ) K(r-u^ K (r-u ) 

J 


( 1 . 10 ) 


when (1.6) and (1.9) are used. rO^ ,u. ,t ,t :z^) is the covariance 
function between points (u^,z^) and (u^,z^) of the object plane at 
time t. As the light is assumed to be spectrally pure, the average 
energy received within the small area AA of the image plane because 
of the object illumination during the observation interval (0,T) is 
given by 


AE - / d Z r j dt r(r,r > .t,t:z 2 ) 

Aa j o 


,m /.to 


« yQ OO 


d 2 r 


AA 


J J 1 1 


K(r-H ± ) K (r-Uj) 


( 1 . 11 ) 


as (1.6), (1.7) and (1.9) are used. 


» 



Intensity representation of light from an Incoherent object 

The average intensity at point (jc,z^) of the image plane and 
at time t due to the coherent object illumination can be expressed by - 
(1.10) or can also be written as 

fO CO 

< " I im < i >z 2 jK(r-u.) VCu^z^t) d^f 2 ) . (1.12) 

-CO -co 


For an incoherent object, the spatial coherence function from (1.7) 

/ o\ 

can be defined v } as ' 



(1.13) 


where B is the radiance so defined that B(ji^)/4ir is the power emit- 
ted per unit area per steradian in the direction z normal to the object 
plane. The radiance B(u^') is of limited extent. Here k Q = Q o /0 o with 
C Q the velocity of the light, and 6 (u ) is the two-dimensional Dirac 
delta function. The average intensity given by (1.10) can be 
expressed in terms of the radiance B(u^) as 


< I im (£.!! 2 » t » = 2lrk o 2 


CO CO CO CO 


-CO —CO -CO - CO 


Jjj KCr-v^) K (r-Uj) 



/ CD -CO 

J S{t-u ± ) d 2 ^ 


(1.14) 


-CO -CO 
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o’. 

where S(rMa^) = | is defined as the incoherent point-spread 

function. Thus we can see from (1.12) that for a coherent object 
illumination the imaging system is linear in complex amplitude. 
However, for an incoherent object the imaging system is linear only 
in intensity as given by (1.14). 



The statistical description of a photoelectric detecto r 

. When the photoelectric detector is used for image detection, 
the photosensitive surface — for example, the surface of a photo- 
multiplier — will be placed at the image plane. The output of the 

* / 

illuminated photodetector carries information about the radiation 
field in the sense that the instantaneous probability of the photo- 
electron emission is proportional to the classical instantaneous 
intensity of the light as long as the intensity of the 

light is not so strong that the photoelectron emission cannot be 
described by the first-order perturbation theory. The output of the 
detector is a sequence {t^J of time-instants of absorption-emission 
phenomena. These instants [t^3 are random and constitute a point 
process. If we observe this process for a fixed time interval 
[t,t+T], a total number n of photoelectrons will be ejected from the 
area A of the photosensitive surface. The sequence ... , 

of instants when the photoelectrons are emitted can be written 
symbolically as ft^}. When the total energy received from that area 
A during the interval [t,t+T] is given, the probability that n 
photoelectrons are ejected is a Poisson function: 

' P(n|A T (t)) = (X T (t)) n exp(-X T <t)) / n! f "(1.15) 

where 


16 
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J. T (t) - a f f IvOr.t^l 2 d t]L d 2 r . U- 16 > 

A t 

is the average number of photoelectrons and can be predicted if the 

s 

coefficient a is given, a is proportional directly to the quantum 
efficiency and inversely to the energy hQ/2ir of each quantum of light, 
where h is the Planck’s constant and B is the angular frequency of 
the field. V(r, t) is the complex amplitude of the light field at the 
point j: of the photosensitive surface at time t. Since V(.r, t) is a 
stochastic process, so is the number A^(t) . The probability for n 
photoelectrons in any counting interval [t,t+T] can then be expressed 
only by the ensemble average^ 10 ”* of the conditional probability 
p(n|* T (t))- That is, 

P n (T f t) = <(X T (t)) n exp(-A T (t)) / n!> . (1.17) 

The derivation of (1,15) is well known and will not be given 

here. The operation of the expectation on the probability from (1.17) 
makes it depart from the Poisson distribution. For a stationary 
process, the ensemble average is independent of the initial time t. 

We will use the expression T q (T) to replace P^(T,t) in (1.17) from 
now on. 


/ 



Distribution of the photoelectrons from a single small area when the 
time -bandwidth product TW >> 1 

When linearly polarized light impinges normally on the photo- 
sensitive surface, the probability that a number n^ photoelectrons 
are emitted from a small area dA^ centered at point during an 
interval (0,T) can be written according to (1.17) as 


where 


P n = ^ X i 1 exp(_X d ! n i ! ) 



A i " “ J / /V(r,t)| Z dt d 2 r 
0 J ^A, 



a/ |v(r,t)| 2 dt d 2 r 


0 A 


with f^(.r) as the function defined by 


(1.18) 


( 1 . 1 $) 


f ± (r) - 1 
- 0 


r e dA^ 

r i dA, 
1 


( 1 . 20 ) 


A is the entire area of the surface. A^ depends on the coefficient 
a, the size dA^ and the amplitude |v(r,t)| e dA_^. The distribution 
of the random variable n_^ can also be derived from its moment 
generating function (m. g. f .), which is the ensemble average of the 
conditional m.g.f, with given A^. That is , 
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h ± (s) - <^E[exp (sn_^) | \ ± S) 


»• n -X , 

ex v( sn ±) x i e ^ n i i / > 


n i= 0 


(exp[X i (e S ~l)] > 


( 1 . 21 ) 


Now 


i 

T /-T 


(e S -l) = a f f f . (x) (e S -l) (v(x,t)| 2 dt d 2 x 
0 A 1 




2 ,2 


0 0 A A 


where the kernel 


K 1 (Xi l x 2 :t i > t 2 ) V fe 2 ,t 2 ) d tl dt 2 d x x d x 2 i 


( 1 . 22 ) 


K i (x 1 ,x 2 :t 1> t 2 ) » F i (s:x 1 ) «(x x -x 2 ) <5 (ti-ta) 


(1.23) 


and 

F i < s: x 1 ) = a (e fc -l) f i (x 1 > . 


(1.24) 


The ro.g.f. from (1.21) can be worked out as in the problem of finding 
the m.g.f. of a quadratic functional of a circular Gaussian random 
process^* 13 ^ . One can show that h^s) can be determined from the 


expression 
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h^s) “ 



L(x,t,x,t: 


-u) dt d 



* 


(1.25) 


(4 14) 

vjhcre k(x^»t *x ,t 2 :u) can be solved * from the following integral 
equation 




L fe L ,t l*- S 2* t 2 :U ) + 2u 



0 0 A A 


^X x &\ 2 dh l dh 2 



* ^C^rW r (Xii^2* h l ,ll 2^ L ^2 ,h 2^2* t 2 :u ^ 
Ki&L^rVV r( ^1^2’ h l’ t 2 ) d \ dh l . <1,26) 


For spectrally pure light, the autocovariance function can be written 
according to (1.7) as 

rfes li x 2? t 1 ,t 2 ) = X< t i- t 2 ) . (1.27) 


Thus , substituting (1.23), (1.24) and (1.27) into (1.26) and solving 
for L(x^, t^ j^x^, t 2 :u) recursively we have 



= 2F i (s:x 1 ) <|>(x lt x 2 ) X (t 1 “t 2 ) 

d \ 2 dh 2 F i (s: % ) x(t l“ h 2* 


f ±(s : ^ 2 ) ^(x 2 »2S 2 ) xO» 2 -t 2 ) 



2 3 

+ ti 2 J 


0 0 A A 


2 2 

d X 2 d X 3 dh 2 dh 3 F i( s: 2Si> 


*(£ 2 .£ 3 ) X< h 2 " h 3 ) V 61 ^ 1 Kx 3 .S 2 > X (h 3 " t 2 ) 


"1“ • * « i * 


(1.28) 


By substituting (1,28) into (1,25) we can write the logarithm of the 
m.g.f. as - 


£n h^ (s) = 2x(0) T J F^six) $(x,x) d x 



+ | ( 2 T ) 2 j J F i (s:x) F ± (s:^ 1 ) ^(x,^) K^.x) d ^ 

A A 
/*T f T 

#T ~ 2 J J xCt-hJ xOi.-t) dt dh 
0 0 A . .. 

/ 

+ I (2T)3 J J J F i< s: ^s) V 8 ^) F i (s: *^2 ) ♦<X 2 »S> 

A A A 

/•T /* T /*T 

•d 2 x d 2 ^ d 2 ^ T ~ 3 J Q J Q J Q x(t_h 1 ) X(h 1 -h 2 ) x(h 2 -t) dtdh^ 




+ - • ‘ \ <2T)A •{ • -J V 5 ^) • • F i ( s: ^_ 1 >'t'Ci.i 1 )4'<i ll 2 2 ) 

A A 


*^£-3.*^ d2 & d2 Xi 


• - d Vi rl l ■■{ x(t 'V • • x(h £-r t)dtdh i 


• * d h^ i "^” * * * * 


“ 2 x(0) T y F. (s:x,) <l>(x,x) d £ 

A 
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* ? f j ( 2T) ^ J . * J F i (s:x)F i (s:x 1 ) . . F i (s tej.0^ 

A A 


i>2 


2 2 2 

* * * (£*_!*£) d ^ d ^. x * * d X A «i * P A 


(1.29) 


where 


T /»T 


P, - T 


rt 



. . . dh. 


{ J 0 * *i 0 x(h r h 2 )x(h 2 -h 3 ) . . x(V h i> dh i dh i • • ; 

£ ^ 2 . (1.30) 

Nov; the bandwidth W is defined^"*^ by 


W - |x(0)| 2 [ j |x(t)f 2 dt]- 1 = [ J |x(«)| 2 dfl/air]" 1 (1.31) 


where 


as X (0) = 1 , 


x(fi) 


, . -ifJt j 
X(t) e dt 


(1.32) 


is the temporal spectral density of the complex field at the image 
plane and is real so that x("0 - X (-t) . We now let 
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h ~ li b t 
£-1 £ £-1 , 


h„ - h. 


'£ “1 3 ” (t l + t 2 * * * ' 


(1.33) 


Mien T >> rr we can extend the range of integration to (-«<») and by 
W , ‘ 

changing variables we can write (1.30) as 
-£4-1 


CO 00 4^ CO 


p £ - T 


.-CO —03 —00 


x(t 1 )x(t 2 ) * * x( " t l“ t 2* ,t £-l )dt l dt 2* ,dt £-l 


According to the convolution theorem we have 


X (t^) x(-t 1 -t 2 . * • -t A _j) dt. 


(1.34) 


x 2 (ft) exp[-ifi(t 2 +t 2 + • • - dft/2ir . 


(1.35) 


By substituting (1.31), (1.32) and (1.35) into (1.34) we have 

,-£4-1 i 


P l" T 


2, n * ^ 

x <n) 2? 


■ I 


x(t 2 ^ ••• x(£j£_x^ 


exp[-ifi(t 2 4-t 3 . . . t^)] dt 2 . . . d t £ _ 1 


-£+l 


/ 


x*(fl) dG/2ir 


( wr 1+1 r x 


(1.36) 
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where 


x l (n) 




As WT >> 1, we can approximately write x(ft) ~ — for ft e (-ttWjTtW) 

w 

and x(ft) - 0 otherwise. Thus Rj ~ 1 is finite and P^ ■+ 0 for 
Z ^ 2. For spatially slowly varying light and small area dA^ we 
define 


“ 2aT 


4>(jc, 5:) d x 


= 2aT dA ± , 


as the average number of photoelectrons ejected from the small area 
dA^ centered as^c^. Since for the spatial coherence function 


=■!>(%»%), for all i ^ j , 


I J J. • • F i (s:x)<f»(x > i 1 )F i (s:i 1 )<K Xl> x 2 ) 




| f F i (s:x)4>(x,x ) \ l . 


* 1-1 


Thus as long as n^^ defined by (1.38) is not too large, all the terms 



in (1.29) for l ^ 2 are negligible by comparing with the first term 
as WT » 1. That is 

£n hj.Cs) £* n^(e 8 -l) , (1.40) 

which is the logarithm of the m.g.f. of a Poisson distribution^^ 
with mean n^. 



Pis Crib ut: ion of the photoelectrons from two small areas for TW >> 1 


When the energy received by each of the two email areas dA^ 

and dA 2 centered at and x: 2 is given, the numbers of photoelectrons 

n, and n„ ejected from these areas during the time interval [ t , ti-'l] 

1 • 2 

are Poisson distributed with the given mean values X^ for i = 1, 2 
defined in (1.19) and are statistically independent. The joint m.g.f 
of the random variables n^ and n 2 can be derived from the expected 
value of the conditional m.g.f.: • 

h 12 (s l’ s 2 ) 


where now 

b e 

X 1 (e -1) + X 2 (e -1) 

f 

JJ 8 g • 

dt J d 2 x[f (x)(e 1 -l)+f 2 (x)(e 2 -l)]fv(x, t) | 2 
0 A 

2 2 A 

dt x dt 2 d x x d x 2 K 12 (x 1 ,x 2 :t 1 ,t 2 )V(x 1 ,t 1 )V fe 2 > t 2 ) 




G„n,+S~n 


= <E[e 1122 |X i> i-l f 2]) 


^exp[X^(e *-l) + X 2 <e ^ , (1*41) 


(1.42) 


where f^(x) for i - 1 or 2 is given by (1.20) and 
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K 12^1^2 :t l’ t 2 ) = F U (, 1 ' , 2% )l W (t l' t 2 1 ' (1<43) 


with 


8 - ^2 

F 12 ( s 1 » s 2 : -^i^ = af i^l )(e “D +af 2^1^ e 


(1.44) 


The tn.g.f. can also be expressed in the form of (1.25) and the 
following integral equation must be satisfied. 

L 12 < ^l’ t l’~2’ t 2 :u)+2u f f F 12 ( s l’ s 2~l ) ' , ’^l’ t l l2 2 ,h 2 ) 

0 A 
2 


■ F 12^ s 4. ,s 2 5 *l^^l ,t l ,, *2 ,t 2^ . 


(1.45) 


The kernel L ^2^1* t l*^2 ,t 2 :u ^ * S s0 ^ ve< * ^ or recursively and substi- 
tuted into the joint m.g.f. as described before where t l» t 2 ) 

and F^(syc^) in (1.23) and (1.24) are now replaced by (x^ , : t^, t 2 ) 
and F^ 2 ( s j > s 2 ). We have - 


in h i2( s i»s 2 ) = 2 x( 0 )T J F 12 (s 1 ,s 2 :2£)<|) (x,x) d x 

A 


+ | (2T) 2 / J d 2 xd 2 Xl F 12 <s 1 .= 2 : ^F 12 (s 1 , S2 :j; 1 ) 
A A 
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1 8 

+ - (2Tr 



F 12 (s r s 2 :x) F 12 (s 1 ,« 2 «i) 


AAA 


F 12 ( VV^2 ) *^l ) ^l di 2 ) ' l 'k : 2 , ~ ) P 3 


+....+ 7 (2T)* 


d V 2 Xx • • d X s ,_i F 12 (s 1’ s 2 


F 12 (si > s 2 :i 1 ) • • • F 12 (s 1 ,s 2 :i Jl _ 1 )<(iCx,x 1 )'(>(x 1 tJ 2 ) • • ^(ix-1’^ P j> 


+ * « 


(1.46) 


where P. for £ £ 2 is given by (1.30). By the same argument „ that is, 

Xj 

when the average numbers defined in (1.38) for i *= 1, 2 are not too 
large and WT >> 1, (1.46) can be approximated by the first term or 


fcn. ^22^ S 1 ,S 2)“ 2T 
s 

2£ 2aTdA 1 $( 2 ^,.^) (e 



2 

F l2 (s I ,S 2 : ^^^ d 

S 2 

+ 2aTdA 2 <(>(x 2fc x 2 )(e -1) 


8 S 

« nj(e 1 -1) + n 2 (e 2 -l) . (1.47) 

- S i 

Since £n h^(s_^) = n^(e -1) for i = 1, 2 as given by (1.40), this 
implies that the joint m.g.f. ^^2^ s l‘ s 2^ tlie P roc * uct m >g*f* ,s 


^1^ S 1^ and ^2^ s 2^ as * n ot ^ cr wor< ^ s » the number of 
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photoelectrons ejected from areas dA^ and dA^ during any observation 
interval are Poisson distributed and statistically independent of 
each other when the time-bandwidth product is large. 



Distribution of the sum of the photoelectrons from a large number of 


email areas when WT >> 1 


When there are a large number N of small areas dA_^ for 
.1 = 1, 2 , . . N, the distribution of the sum of the photoelectrons 
from these areas can be derived in the same way just described as for 


N - 2. For generalization, we discuss the distribution of a statistic 
given by 



(1.48) 


where is the center of each small area dA^ where n^ photoelectrons 
are ejected, is a weighting function to characterize the 

location of n^. For example, when = 1, Y Is the sum of the 

photoelectrons ejected from the N small areas. As N is very large 
we will then pass to the limit as dA^ 0, 

The m.g.f. of Y can be written as - 

N 

h v (s) = (E[exp s 1C n | ^]) 

Y i=l 



(1.49) 


where 
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• dt l dt 2 d2 ~l d2 ~2 


Now 


tilth 


v* e3 fe± ) 

V s ^1> = « 2j (e ■ -1) ^fex) , 


and f^CSj) I s defined in (1.20), 

We crust first solve the integral equation 


L 0 < ^l ,t l , ~2' t 2 :u) + 2u 



I N < s .«!> ‘f’ (Si ° : 2 ) X (t 2 _h 2 ) 


(1.50) 


(1.51) 


(1.52) 


0 A 
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Now 


* ^0 ^2 * k.2 ’~2 * t 2 ;u ) d X2 dh 2 


2 . 


(1.53) 




*-N 


- a X) (e e ^ ) -D- /* <i>(x,X> d2 X 

i=l •'dA. 


L 

i*l 


_ s$(x.) 

^ a 2-# (e " 1 ) ♦fe 1 »X> dA i » 


(1.54) 


as I s slowly varying spatially. When N is very large we can 

pass the limit as dA^ •*■ 0. (1.54) can be written as 


'A 


-£ n ( s >£) d X 


s3(x)_. 


a § (e 1) d x 


(1.55) 


The logarithm of the m.g.f. can be obtained when the kernel 

^0^1* t i^2 ,t l !u ^ sodved first recursively from (1.53) where (1.54) 

is substituted. We have 


^0^1* ^1 ^2 * ^2 ‘ 
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+ 2 3 (“u) 2 ^(s,^) J j f d h 2 dh 3 d 2 i2 d 2 ^ 


83 (x 2 ) s$<£ ) 

(e * -1 ) < e "D ♦feiJLZ 2 ^ ^2 ^ 


X(t r h 2 ) X(h 2 -h 3 ) x0» 3 “'t 2 ) 




(1.56) 


The logarithm of the m.g.f. can be derived as we pass to the limit 
dA^ -> 0 where (1*55) and (1.56) are used. 


£n h Y (s) 



I du 

f 0 ^0 "A 

^ s3(x)_ 


Lq(x, t ,_x>ti~u) dt d x 


~ 2ax(0)T j ( e °’ J ^-1) ^.x) d jc 

'a 


+ I (2aT)2 J A l d *l d *2 


S 3(x, 1 ) ^(x 2 ) 

(e -1) (e -1) 4 '(x: 1 »X2 ) 


+ J (2aT) & 



A •'A 


.2 A 2 

d d *2 


P 2 + 


‘ d \ 
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• [TT (e SB(il) -l)] 


(1.57) 


where P for l ^ 2 is given by (1.30). As WT >> 1, P -*■ 0 for all 
x. * 

£ ^ 2, and by the same argument used in (1.39), all the terms in 
(1.57) except the first term become very small. Then we can 
approximate 


In h Y (s) « 2ctT jf (e® 3 ^-!) <|>(x,x) d 2 x , 


(1.58) 


or the m.g.f. of the statistic Y can be written approximately 


h y (s) ~ exp 


[“1 


(e 83 ^~l) *(x 


y.x) ] f 


(1.59) 


where the mean and variance in general are 


E[Y] * 2aT l 8(x) $(x,x) d x > 

J k 

Var[Y] = 2aT J 6 2 (x) . 


(1.60) 


For 3(x) = 1, (1.59) becomes 


h (s) - exp 


j^2aT ^ 4>(x,x.) d 2 ^c (e S -l) J 


(1.61) 



which is the m.g.f. of a Poisson distribution with equal cumulants 


2aT S $(x, x) d x, for all orders. 

*A ~~ ~ 

On the other hand, as we expand 




(s(3 (x) ) m / ml , 


m*=l 


(1.62) 


if 


4 / e ra (x) d > <<: 


m 


* ‘'A 


~ J tv d2 2S 


for m ^ 3 


(1.63) 


we can further approximate (1.59) as 


h y (s) - exp 


[ 2aT J s 6(x 


fe) d x 


+ 2aT ^ s 2 6 2 (x) 0s*^> d 2 ^ j 


(1.64) 


which is the m.g.f. of a Gaussian distribution. 
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Chapter II 


Binary detection of an optical signal at a 
photosensitive surface 

Two important criteria, Bayes and Neyman-Pearson , used in the 
simple hypothesis test for binary detection will be briefly described. 
The optical signal is detected by using the photoelectric detector. 

The photosensitive surface of the device is divided like a mosaic 
into many small equal areas so that the observed data n. are the set 
of numbers of photoelectrons ejected from all the small areas. The 
optimum statistic^ obtained from the likelihood ratio in the presence 
of the uniform background light will be discussed and its moment- 
generating function (m.g.f.) will be derived. Because of the com- 
plexity of the m.g.f., the distribution of the optimum statistic will 

( 2 ) 

be approximated by the method of steepest descent and compared with 
two other approximate forms, the Gaussian and gamma distributions. The 
detection of the optical signal in the absence of the background light 
will also be discussed. Two other detectors, the threshold and the 
simple detector, will be studied. In order to investigate the per- 
formances of these detectors, a Gaussian image will be postulated and 
the false-alarm probability, the detection probability, and the average 
error probability will be calculated. 
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Binary hypothesis test 

In a communication system when a set of data b_ = - • .b^3 

(3 4) 

Is observed, the decision between binary hypotheses * and 

where 11 q is the hypothesis of background noise alone and is the 

hypothesis that a signal is also present, can be viewed as a rule for 
dividing the total N-dimensional observation space E of the data Jb 
into two regions, E^ and E^. Hypothesis will be chosen if the 
observed data b_ fall in the region E^; will be chosen otherwise. 

The joint conditional probability density function (p.d.f.) P(b|H^) 

under for k - 0 or 1 is to be given t and the likelihood ratio defined 

ky 


rCbJV 
A(y = nEK) 


( 2 . 1 ) 


will be calculated. If the Bayes criterion is used, a decision level 
defined by 


« C 10 " c oo^ 

(1 -mc 01 - c n j 


( 2 . 2 ) 


can be evaluated, where the prior probability £ of hypothesis H Q and 
the costs C ±j of choosing hypothesis when happens to be true 

(I, j - 0,1) , are given. The regions Eq and E^ are divided in such a 
way that I Q contains all the data for A(b) £ A Q and for A(b) > A Q . 
The false- alarm probability and the detection probability are defined 
as 
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Q - Pr [A(b) > A 0 JH 0 ] » f P(b|H 0 )db , 

J h 

= Pr [A(b) > = J" • (2.3) 

• 1 

When there is no information about the prior probability £ and the cost 
functions, the appropriate way to make the decision according to the 
Neyman-Pear s on criterion is that the false-alarm probability Qq will 
not exceed the preassigned value. The decision level Aq must be solved 
from the expression of Qq by (2.3), and the hypothesis or Hq will be 
selected according as A(b) > Aq or A(b) £ Aq. 

If £ is a set of discrete random variables, the likelihood ratio 
A(b) defined in (2.1) will be the ratio of the probabilities P(b (H^) 
and P(b(H 0 ). When the Neyman-Pear son criterion is used, randomiza- 
tion^ must be applied. That is, a probability f of choosing must 
be assigned at A(b) = Aq so that the false-alarm probability 

Q q = Pr[A(b)>A 0 |H 0 ] + f Pr[A(b)=A 0 |H Q ] 

= Ptt lH„) + fP(b:A(b)=A n |H n ) (2. A) 

A(b)>A 0 V ’ 1 0 0 0 

equals exactly the preassigned value. For given Qq, both Aq and f can 
be solved from (2.4) and will be discussed later for Poisson distribu- 
tion. Hypothesis or Hq will be chosen for A(b) > Aq or A(b) < Aq. 
The detection probability for the correct decision can be evaluated in 
the same way after the decision level Aq and probability f are 
obtained. That is , • 



AO 


Q d " A (fel P< ^i H 1 ) ■+ fPCtsACb) - A 0 | h i } # 


(2.5) 


The average error probability for each decision is defined by 


P e = 5Q 0 + (1-C) (1-Q d ) 


( 2 . 6 ) 


If the relative costs and given in (2.2) are equal, 

the hypothesis with greater posterior probability PCH^j^) for k = 0 

or 1 is always selected when the Bayes criterion is used and the error 

(3) 

probability P^ given by (2.6) is minimized 

The reliability and error probability are used to measure the • 
performance of the detectors. One detector is said to be better than 
the other detector if for a fixed value the former has higher de- 
tection probability and hence smaller error probability. 

The operation of the hypothesis test for binary detection can 
best be described by the block diagram on the next page. The selection 
of the Bayes or the Neyman-Pe arson criterion depends on the information 
about the system we have and its application. The block labeled 
"computer" implies that certain numerical calculations will be involved 
so that the decision level Aq, the reliability { Qq , Q^) or the error 
probability P g can be calculated. 
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Figure 2 Block diagram of hypothesis test 
for binary detection. 








The Ideal photoelectric detector 


When a luminous object is located in the field of view of an 
optical detector and focused on the photosensitive surface, the 
photoelectrons emitted from the surface will be proportional to the 
intensity of the object light, which is a fluctuating function of 
time as discussed in Chapter I. If there is also background light 
incident through the aperture, the information carried by the object 
light will be corrupted. As the intensity of the object light is not 
much stronger than the background light, in order to decide whether 
a certain luminous object is present or not, we can use strategies 
according to the hypothesis tests which we have just discussed. The 
basic elements of the optical detector have been shown in Fig. 1, 
where the photoelectrical emission surface of the photo-tube is 
placed in the image plane and is divided like a mosaic with a large 
number N of small equal areas dA^ as shown in Fig. 3. 



Figure 3 


Photoelectrical emission surface 
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The received data, are the values .n - f n i> n 2 * * * obtained by 

measuring the numbers n ± of photoelectrons ejected from each area 
dA « dA for i = 1, 2 . • . N during the interval (0,T). Since the 
background light in general also enters the aperture, the observer 
must choose between the two hypotheses:- (H Q ) only background light is . 
present , or (H ) object light is also incident through the aperture in 
“addition to the background light. When the observation time T is much 
greater than W" 1 , the numbers n ± of photoelectrons emitted from each 
area dA^ under hypothesis (k = 0, 1) are Poisson distributed and 

statistically independent of each other as discussed in Chapter I. 

The probability is given by 


r n i ~ n ik 


\ (T IV * n ik e ' V 


(2.7) 


where n is the average number of photoelectrons emitted during the 

ik 

interval (0,T) for k = 0, 1 from the ith area dA i and 


ctTdA I, 


k - 0, 1 


( 2 . 8 ) 


with 


^fci) 88 2 ^k^i^i^ * k = 0, 1 


(2.9) 


when (1.38) is used. . ) is the spatial coherence function under 

k i j 

hypothesis for k ■ 0, 1 at point and pointy, ^k^i^ the 



mean intensity at point x^ The joint conditional probability for 
observing the data n_ under hypothesis for k - 0 or 1 is 


v t iv -]"[ v (t iv 

** i=l i 

- TT "ik 1 e ik ' v . (2,io) 


The likelihood ratio defined by (2.1) becomes 



( 2 . 11 ) 


and will be compared with the decision level A^ according to the Bayes 
strategy or Neyman-Pearson strategy. Hypothesis 11^ will be chosen if 
A{n) > Ag^and will be chosen otherwise. It is also equivalent to use 
the optimum statistic 



( 2 . 12 ) 


since AfnJ is a monotone function of g which is obtained from the 
logarithm of the likelihood ratio A[nJ. The decision level of 
this statistic is given by 
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g 0 « £n A q + AN , 


(2.13) 


where 


AN 



n iO> 


(2.14) 


is the total average number of photoelectrons emitted over the entire 
area of the surface when the signal field alone is present. Thus we 
can compare g with instead and select if g > Sq and when 
g £ g . When Neyraan-Pearson criterion is used, the false-alarm 
probability is preassigned. The distribution of the statistic g 
under must be known so that the decision level can be determined 
from either of the expressions in (2.3) or (2.4) as discussed pre- 
viously. To examine the reliability or the error probability of making 
'each decision we must also know the distribution of the statistic g. 

c 

The mean intensity I^(x i ) given in (2.9) can be written as 


I k ( % ) = I o (£ i ) + k - 

k = 0, 1 


(2.15) 


where l^(x^) is the mean intensity due to the background light. In 
most cases, such as thermal background light, I^(x^) can be assumed to 
be spatially invariant for all areas dA. over the surface. I (x . ) Is 
the mean intensity due to the object light in the area dA^. Therefore 



we can write 


W “ T b • for a11 dA i * 


I B (x±) = Ig yC^) , i = 1, 2 . . . N 


v (2.16) 


where I = I I (x) d^x is the total power transmitted to the 
. J k 8 

surface from the object and yC^) is the weighting function to 
characterize the distribution of the power at point „x_. . A is the 
entire area which is divided into N equal areas dA. • 

We also define a function 


H (%) " n iX / n i0 

° [IqOS^) + t ^O^i^ 

- 1 + x s YCx,) / ^ 

1 + D 2 u(x ) (2.17) 

^ » 

2 ' 

where D is called the signal-to-noise and is the ratio of the total 
average number cxTI Q of photoelectrons ejected from the surface by the 

J 

object light illumination to the average number aTl^A^ of photoelectrons 
ejected from a finite area A q of the surface by the background light 
illumination. The function u(x^) is only different from yQ^) by the 
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factor A Q ,that is u^) = A^C^). A q is arbitrarily defined, 

otatistic g from (2.12) can be written as 


g « Z-/ n An H(x ) (2.18) 

i«l 

which is similar to the statistic Y of (1.48) with 8(x ) « An H(x^) 

as discussed in Chapter I. As N is very large, by passing to the limit 
as dA + 0, the m.g.f. of the statistic g under hypothesis as TW >> 1 
can be written according to (1.59) as 


V s) = E [ e ' sg l\] 


[cT jf (e- slnH ^-x) I k Cx) d 2 x] 

[cxTIh J H k (x) [h _S (x)-i] d 2 x J , 


k = 0, 1 


The logarithm of h^s) 


in h^s) « ciTI^ J H k (x) [h S (x)-lJ d 2 x , 


for k «* 0, 1 


The distribution of the statistic g .cannot in general be obtained in 



exact form. Only an approximation can be. used and will be discussed 

later. 

The logarithm of h^s) from (2.20) is differentiable for any 
order with respect to the variable s,and the nth cumulant under hypo- 
thesis for lc = 0, 1 can be expressed as 


C ■ <-« n £ to V 8 > 

ds 


s-0 


**- ctTI^ J H k (x) ln n H(x) 


( 2 . 21 ) 


In particular, the mean and variance of the statistic g can be cal- 
culated by - 


E[g|\] - “TI b f Hk <^) 11 d £ 

J A 


Var[g|H^] - aT hJ A d ^S 


k - 0, 1 


( 2 . 22 ) 


If the average intensity I (x ) given in (2.16) is zero outside a 

6 3. 

finite area of the photosensitive surface, that is, if 

= 1 s ( ^i ) + for -^i c A i and , = I o^i ) otherwise > 

then H(x^) = 1 when jx^ i and the optimum statistic g involves only 

the emission within the area A^, so that there wj.ll be a finite proba- 
bility of getting no electrons under hypothesis H^, 




which is the result of (2.19) as s ■+■ 

When is also spatially invariant, so is the function 

H(x ) . The optimum statistic is therefore equivalent to g/£n C, where 
C = H(x.) is constant for all i, and the statistic can be written as 


g 



(2.24) 


which is the sum of the photoelectrons from the total number N of small 
areas dA. As TW >> 1, the m.g.f. of the statistic g' is given by (1.59) 
with $(x) “ 1 as discussed in Chapter I. g' is therefore a random 
variable with a Poisson distribution. 

The detector just described represents a particular way of 
processing the light field behind. the aperture of the optical system 
in order to choose between the hypotheses. It has been compared with 
the optimum means of processing the field from the point of view of 
quantum detection theory When diffraction is negligible and the 


surface has unit quantum efficiency, this detector is equivalent to 
the optimum detector of the image-forming light 



Threshold detector 


The structure of the optimum statistic is based on the 

2 

function H(x) which is a function of D u(x ) as given by (2.17). When 

2 

the signal-to-noise ratio D is not known, the detector must be 

2 

designed for some reasonable standard value. If D is very small, the 

appropriate thing to do in detection theory is to expand the logarithm 

of the conditional likelihood ratio A[njD ] from (2.11) into a power 
2 

series in D', and the optimum statistic is equivalent to the threshold 

2 

statistic, which is based on the lowest order of D in that expan- 
sion^. For detecting the image on the photoelectric surface we have 

just discussed, the optimum statistic g from (2.18) can be expanded 

2 

into a power series in D or 

N 

n, 2 <-l) m “ 1 [D 2 u(x )]” / m! . (2.25) 

' i-1 1 m=l 

2 

Since D is very small and is independent of the location, we can base 

the decision on the threshold statistic, which is the coefficient of 

2 

the lowest order of D in (2.25). We have 

N 

g *= 23 n u(x ) (2.26) 

0 1=1 1 1 * 

The m.g.f. of this statistic can be derived for TW >> 1 and is given by 
(1.59) where ^(x^) = u (x^)- That is when N is very large. By passing 
to the limit as dA *> 0 we have 
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h 0k (s) ° E e 


** exp { ctT I. 


f ..k, N f -su(x) - ,2 

I II (x) e w -1 d _x 


k *= 0„ 1 , 


where H(x) and u(x) are given in (2.17), and the logarithm of (2.27) is 


*n h ek (s) -■ «T I H (x) 


/ H k (x) 


for k B 0, 1, 


(2.28) 


The nth cumulant of the threshold statistic g 0 under 


can be written as 


G ek n = ( - 1)n * nh ek (s) s =o 


“ <xT I / H k (x) u"(x) d 2 x 
* A 


for k - 0, 1 » 


and the mean and variance are 


E[g 0 |\] «= X^J H k (x) u(x) d 2 x , 



(2.30) 


' Var[gg |H^] - aT hf k " 2 d £ - 

Since the m.g.f . from (2.27) is dependent on the signal-to-noise ratio 

D 2 only through the function H(x) , the distribution of the threshold 

2 

statistic will not depend on D when is true. 

0 V 



Simple detector 


Instead of weighting the number n of photoelectrons from each 
area dA^ with a factor £n H^) , a simple way is to collect all the 
photoelectrons ejected from a certain area of the surface without 
any weighting factor about the location. As TW >> 1, the probability 
of n photoelectrons ejected from the area A^i under hypothesis has 
been proved in Chapter I to be Poisson distributed and 

P n (T|H k ) = e" nTk / nl , (2.31) 


where 



J= 



k *= 0 or 1, (2.32) 


is the average number of photoelectrons ejected from the area A^t under 
ligand I k (x) is given by (2.9). The likelihood ratio of the simple 
detector according to (2.1) is 


A s (n) = P n ( T ! H i> / P n (T l H 0 ) 



(2.33) 


Hypothesis Hq will be chosen if A g (n) ^ an< ^ will be chosen 

otherwise, where A _ is the decision level, which can be obtained 

sU 

according to the criterion we use. Because the likelihood ratio A g (n) 
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of the simple detector is a monotone function of n, it is therefore 
equivalent to use the optimum statistic from the logarithm of A g ( n ) 


n 


(2.34) 


and hypothesis ^ will be chosen if g g > g fi0 and H Q otherwise except 
that at g = g a probability f will be assigned for choosing 
When the Bayes criterion is used the decision level g^g of the statistic 
can be determined from the likelihood-ratio decision level A^g and 


g g0 " ( * n A s0 + An T ) [iUl(1 + .- A V n T0 ) ‘ 1 " 1 


(2.35) 


where An^ - - n T0 is the average number of photoelectrons ejected 

from the area by the object light illumination alone and can be 
calculated from the expression 

Aiij, » aT X g J* y(x) d 2 x ^ (2.36) 

The optimum area A^« for the observer to adopt is the area where the 
detection probability is maximum for a fixed false-alarm probability or 
the error probability is minimum. 



Reliability and error probability of the ideal detector In the absence 
of the background light 

The performance of a detector can be characterized by the re- 
liability, Q and Q , and the error probability P as defined by (2.3) 
and (2.6). When there is no background light, there will be no photo- 
electrons ejected from the photosensitive surface under the hypothesis 
Hq. For both Bayes and Neyman-Pearson criteria the strategy is to 
choose whenever one or more than one photoelectron have been' 
observed. As TW >> 1, the joint probability for observing the data 
ji e ( n i» n 2*'* n N) ^ rom t * ie N small areas under hypothesis is given 
by (2.10). Since the probability of zero photoelectron emission 
under is 1, the likelihood ratio at n. « 0 is 

-H 

A(n=0) = e 8 , (2.37) 

where 

N s • aT X g (2.38) 

is the total average number of photoelectrons ejected from the photo- 
sensitive surface due to the object light. For the detection of the 

image when Bayes criterion is used, hypothesis H will be chosen at 
-N l 

*= 0 if e 8 > Aq and otherwise, with given by (2.2). If the 

relative costs - Cqq and are equal, the hypothesis with 

greater posterior probability P(KL j_n=0) for k *= 0, 1 Is always 
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selected where 


ro^ | n=0) = P(H k ) P(n=0 | y / P(n=0) , 

k = 0 or 1 . (2.39) 

The minimum average error probability will be 

-N 

P « min(£, (l-£) e S ) . (2.40) 

e 


In particulars when the hypotheses are equally likely 



(2.41) 


For the Neyman-Pearson criterion, the randomized strategy must be used. 
As the false-alarm probability is preassigned, hypothesis II ^ will be 
chosen with a probability when no photoelectron is emitted from the 
surface during the interval (0,T). The detection probability at this 
case becomes 


Q d = 1 - (1-Q 0 ) e ^ . 


(2.42) 


Thus the error probability can be calculated again according to 

( 2 . 6 ). 



Reliability and error probability of the ideal detector in the presence 
of the background light 

When the background light is present and Iq(x) = 1^> the opti- 
mum statistic g is expressed by (2. 18), where its m.g.f. is given by 
(2.19). It is not in general possible to evaluate the distribution, 
which is known as an infinitely divisible distribution^ . 
Helstrom^^ treated the distribution by a Gaussian approximation, 
where the p.d.f. of the statistic g is approximated by 


K, (g) - 


2ti a. 


2 

exp[-(g-g k ) / 2 o k 2 ] 


(2.43) 


with 


8 k » EEgl^] 


\ “ Vartgl^] , 


for k = 0, 1 


given in (2.22) . 

The reliability of the detector can then be expressed by 
the error-function integral. 


Q 0 = Pr[g>g 0 |H 0 ] - erfc (y Q ) # 

Q d •= Pr[g>g 0 |H 1 ] - erfc ( y 1 ) , (2.44) 
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with 


erfc (y^) 



>’k = (8 0 - 8 k } ' °k 


k ■= 0, 1 


As we have discussed in Chapter I, with the m.g.f. given by (2.19) the 
distribution of the statistic g can be approximated by the Gaussian 
function if only the condition from (1.63) is true. In other words, 

with B(x) = Jin H(x^) the condition from (1.63) will be satisfied if 

2 2 
D << 1. For large values of the signal-to-noise ratio D , the 

Gaussian approximation will not be accurate because the mth order 
cumulant of the statistic Increases as m increases. Farrell^^ 
recommended approximating the distribution of the statistic g by a 
gamma distribution function. The p.d.f. of the statistic is approxi- 
mated by ‘ 

G k (g) = [a k 1 r( V ] (a k g) k e kS (2 - 45 > 

with 


a k “ ' Var CBl\Jl 



80 that the mean and variance of the gamma random variable are matched 


with the mean and variance given by (2.22). Here TO^) is the 
gamma function. 

The reliability of the optimum detector is then approximated 

by 


p Cs > 8 0 I H 0 3 * / e 0 (z) dz » 


o 


Q d == PCg>g 0 |H 1 ] * / 0 1 (z) dz , 


(2.46) 


where 


B k (z) ■ vfc) ^ *" z 


z k " a k 8 0 , 


k - 1, 0 


On the other hand, it is reasonable to use the method of steepest 
descent as illustrated in Appendix A, where well-known distributions, 
such as Gaussian, gamma and Poisson, have been used as examples and 
good numerical agreement has been obtained, as listed in Table 3 and 
Table 4. 


The optimum statistic g given by (2.18) is a non-negative 
random variable since the function H(x) exceeds 1 for all x. The 
m.g.f, is given by (2,19). The probability that g exceeds the decision 
level under R can be written as 
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pr[g>e 0 |i^3 = 



dF k (g) 


> 


k « 0, 1 . (2.47) 


and can be approximated by the asymptotic expansion series given by 
(A12) in Appendix A, where F k (g) is the distribution function of the 
random variable g. The complex phase of the integral defined by (A7) 
under hypothesis for k = 0 or 1 is 

-1 

<f> k (s) = g Q *n h k (s) + s 

- g 0 -1 aT J H k (x) [<H(x)) _S -l] d 2 x + s , 

A 

for k - 0, 1 (2.48) 


when (2.20) is used. 

There is only one saddle point for this complex phase as 

discussed in Appendix B, and it. can be determined from the equation 

d 


ds *k (s) 


0 or 


aT I. 


^ (H(x))k 3 £n H(x) d^x 


(2.49) 


If we use the statistic 


N 


g* 


Z n 


i°l 


n il 

. In + N 

1 n i0 8 
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instead of the statistic from (2.12) we can derive the m.g.f. of g’ 
under hypothesis for k = 0, 1 


as 


h if f (s) = exp 


joT Ih jT [(H0s)) k " 8 -l] d 2 x + (8-k) N s J 


which gives us the fundamental relation for all likelihood-ratio de- 
tectors as 


hj_(s) - h^(s-l) . 


' (2.50) 


The saddle point s 1 under H 1 can be directly obtained through the 
saddle point s Q under H Q for real values by 


S 1 = 8 0 + 1 • (2.51) 

The relation from (2.51) also satisfies (2.49) for a fixed value of g Q 
for both H. and H A . Whether the saddle point s for k = 0 or 1 is 
positive or negative will depend on the following conditions 


< o if g 0 > ECglig , 


> 0 if g Q < E[g |ll k ] ? 


0 


if 8 q “ ECgi^k] > 


k ■» 0, 1 


(2.52) 
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The nth derivative of the complex phase 4> k (s) for n ^ 2 under hypo- 
thesis is 

C (s) = tit 

as 

* <-l) n gn" 1 «T 1^ f (H(x)) k S £ n n H(x) d 2 x , 

A 

for k «= 0, 1 - (2.53) 

From the relation. (2.51) we can have 

= *i” (b i ) > for n 1 . (2.54) 

The f alse-alarm probability and the detection probability can be approxi 
mated by using (A12) from Appendix A 

Q q = 1 e q(&q) “ I o^ g O^ j 

Q d = 1 " W " hV , (2-55) 

where 

1 - erfc[(-2 g Q 4 ) k (s k ))^] , for s k < 0 

E k ( g 0 ) 

erf c[ (-2 g Q 1, for 6 k > 0 
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and 


W 


ex P [ g 0 * U < S k 5] 


£2* 8 0 \ <2)(s k )]l5 j= ° 


-2 


-1 




2j 


n=0 


n 


<-s k ) 


-2 j+n 


*=0 


£,n 




V 2 M % -i 

-sign(s k ) ^ 2 j k I 24 > k <s k ) ) (g 0 ^k <S k ):> 


where A 


£,n>l2 


, sign(x) are defined in (A12) and (A10) for integers 


r £+j 


£,, n, j. If the Bayes criterion Is used, the decision level is cal- 
culated by (2.13). The saddle point can be obtained by solving (2.49), 
and <J> k (s k ) and ^^(s^) fo * k = 0, 1 can evaluated from (2.48), 
(2.53) and (2.54) where numerical integration will be required. The 

r 

probability or will then be approximated by summing the series 
given in (2.55) up to the term after which the absolute values of the 
terms either begin to increase or become insignificantly small. If the 
Neyman-Pearson criterion Is used, the value of is preassigned and the 
decision level must be hunted by iteration. Since g n is a monotone 


0 


function of the real value s^, both and Sq can be found simultaneously 
by iteration until the false-alarm probability calculated by the asymp- 
totic expansion approaches the preassigned value within a tolerable 
error such that (2.49) is satisfied. The saddle point s^ under will 
be obtained according to (2.51), and the detection probability will 


be approximated by the expansion (2.55). The numerical iteration pro- 
cedure for searching for the saddle points s^, s^,and the decision 
level is described in Appendix B. 

When the relative costs - C^q 'and - C are equal, the 
decision level for Bayes criterion is given by (2.13) with 
Aq *= £/(l~£)» The saddle points and s^ are calculated from (2,49) 
and (2.51). The average error probability is minimized and can be 
obtained after the probabilities and are approximated by the 
asymptotic expansion (2.55). The performance of the ideal detector 
will be studied with a Gaussian image and compared with the threshold 
detector and simple detector in the following paragraphs. 



Performance of the ideal detector for a Gaussian image 


In order to investigate the performance of the ideal detector 
and compare it with the other two detectors, the threshold and the 
simple detector, a Gaussian image will be postulated. The average 
intensity function on the photosensitive surface due to the object 
light given in (2.16) is described by the weighting function 


Y(&) 


1 

~ exp 

2tto 



(2.56) 


where o is the width of the image, this might be the image of a circu- 
lar nebula or, more important, of a point source whose light has 

(14) 

passed through a turbulent medium . If we define 


N Q *= 2u o 2 ^ aT (2.57) 

i 

as the average number of photoelectrons emitted from an area 
2 

A *= 2ir a of the photosensitive surface due to the background 

2 

light and with N given by (2.38), the signal- to-noise ratio D and 
s 

and function u(x) from (2,17) can be written as 


D « N /N n 

S 0 i 


for Nq i 0 


uQc) = exp 


2 u. 2 
X 1 +X 2 

2a 2 


(2.58) 
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In the absence of the background light, that is = 0, and for a 
false-alarm probability Qq preassigned at 0.2, 0.1 or 0.01, the de- 
tection probability Q d is plotted as a function of N g In Fig. 4; 
(2.42) is used to calculate Q^. 

If the background light is present, the m, g.f. of the ideal 
statistic g at WT >> 1 for the Gaussian image is given by 


N, 


.co co 


h k (s) = exp 


(1+D 2 u(£)) k [(1+D 2 uCx))" S - 1] d 2 x 


2-na -«> 


k = 0, 1 


(2.59) 


when (2.19), (2.56), (2.57), and (2.58) are used, 
Now we can change variables by letting 


: ^ ■= Ro cos 6 , 


Ro sin G 


so that ' 


u(x) « exp 


K 


The m.g.f. from (2,59) becomes 


h k (s) » exp 


\fl 

2tt o 2 0 


<1+D 2 exp(-R 2 /2)) k 


[(1+D 2 exp (-R 2 /2) )"* S -1] o 2 RdRdG 
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/* 1 r " 


*» exp 

N 0 J U+D 2 y) k ^(l+D 2 y)" S - 1 

•a 

rH 

1 

>> 


k '** 0 or 1 


(2.60) 


• 2 
— R ^/2 

where (2.60) is obtained by changing the variable y = e . The 

complex phase required in the asymptotic expansion series under 
for lc == 0, 1 as given by (2.48) can be written as 


-1 


*,.00 = g 0 - N q / (1+D 2 y) k (l+D 2 y) _S - l] y" 1 


dy + s , 


k « 0., 1 


(2.61) 


where the saddle point is obtained by solving the equation 

1 


g 0 - N 


Q J (l+D 2 y) k “ S £n(l+D 2 y) y _1 dy , 


k « 0 , 1 


(2.62) 


and and s^ satisfy (2.51). 

The nth derivative of the complex phase <f>^(s) for n ^ 2 under 
hypothesis is 

♦ k "<») = (-l) n g 0 _1 N 0 f (1+D 2 y) k_s in n (l+D 2 y) y _1 dy , 


k n 0 or 1 . 


(2.63) 



Figure 4 Detection probability Q d as a 

function of N defined in (2.38) for the 

s . • • 

ideal detector in the absence of background 
light; f alse-alarm probability « 0.01, 
0 . 1 , 0 . 2 . 




The mean and variance of the optimum statistic g from (2,22) for the 
Gaussian image under for k - 0, 1 becomes 


. E[g|\3 " 

Var[g | h^] = 



(1+D 2 y) k 



(1+D 2 y) k 


Jl - n(l+D 2 y)y"^dy , 


£n 2 (l+D 2 y)y“ 1 dy 


(2.64) 


In order to compare the three approximation methods, steepest descent, 

Guassian,and gamma, the reliability of the ideal detector has been 

calculated by the three approximate forms as given in (2.55), (2.44), 

2 

and (2.46) respectively. At typical values of n q = N s “ 5 ( D = ^ * 


the false-alarm and detection probabilities of the statistic are 
plotted as functions of the decision level in Fig. 5. It can be 
noticed that both Gaussian and gamma approximations with the mean and 
variance matched by (2.64) are least accurate in the tails of the 
distributions. For small false-alarm' probability Qq, there may be a 
serious error in the decision level gg if either one of the approxi- 
mate forms is used under the Neyman- Pear son criterion where Qq is 
preassigned. To apply the asymptotic expansion series given in (2.55), 
an iterative search method was used to determine the decision level 
and the associated saddle point Sq. Detailed description of the 
iteration is given in Appendix B. 

To investigate the performance of the optimum detector and 
later the threshold detector, the detection probability of the 
optimum detector is calculated by using the asympotic expansion series 



from <2.55) and is plotted as a function of the signal-to-noise ratio 

for typical values of Nq at 0.5 and 5 in Fig. 6 where the false- 

-3 -5 

alarm probability is preassigned at 10 and 10 . The detection 

probability is also plotted as a function of the mean number N g by 

-3 - -5 

the solid curves in Fig, 7 at Qq = 10 and in Fig. 8 at = 10 ♦ 

The detection probability at = 0, given by (2,42), is also 

plotted in Fig. 7 and Fig. 8 for the purpose of comparison. When 

the relative costs C 1Q - C Q0 and C Q1 - are equal, the minimum 

error probability of the optimum detector at £ = % can be calculated 

by using (2.6) and the asymptotic expansion series (2.55) and is 

plotted as a function of the mean number N g by the solid curves in 

Fig* 9 where the decision level g A = N is obtained according to (2.13) 

us* 

and the saddle point s Q is searched for by iteration to satisfy (2.62). 



Figure 5 Probability Q q and probability 
Q d as a function of decision level g Q for 
the ideal detector calculated by the saddle 
point » Gaussian and gamma approximations; 
N q « N g « 5, where N Q is defined in (2.57). 
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Figure 6 Probability Q, as a function 

® 2 

of the signal-to-noise ratio D defined 


in (2.58) for the ideal detector; 
N q - 0.5, 5, Q q = lO -3 , 10“ 5 . 
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Figure 7 Probability Q d as a function of 
N g for ideal, threshold and simple detectors 
Q 0 - l(f 3 at N q = 0, 0.5, 5. 




Figure 8 Probability Q d as a function of 

N for ideal, threshold and simple detectors, 
8 -5 





Figure 9 Average error probability P e in 

detecting the Gaussian image against the 

uniform background light vs. the mean number 

N for ideal and simple detectors; N„ = 0, 
s ^ 

0.01,0.5, 5. 




Performance, of the threshold detector for a Gaussian image in the 
presence of the background light 

2 

When the signal-to-noise ratio D is small and the threshold 
statistic g n from (2.26) is used the m.g.f. given by (2.27) for a 
Gaussian image becomes 


h ek (s) = 



2 , svk 


k - 0 or 1 . 


(2.65) 


Again by changing variables, we have 


h ek (s) exp 




*2tt r » 


2iro 0 0 


(l+D 2 exp(-R 2 /2)) k 


(exp (-s exp (-R 2 /2) )-l) a 2 RdRd0 


exp 


M 


(1+D 2 y) k (exp(-sy)-l) y ^dy 


k ** 0 or 1 . 


( 2 . 66 ) 


The mean and variance of the threshold statistic g Q under for 
k *= 0, 1 are 


82 
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BCgelHfc] - N 0 J Cl+D 2 y) k dy - N 0 (l+kD 2 /2), 


Var[ g() |H k ] = N 0 J (1+D 2 y) k ydy - N Q (^kD 2 /3) , 


k *= 0, 1 . 


The complex phase defined by (A7) under for k = 0, 1 is 


* n h m,( s > + s 


“ 8q 0 "^ (l+D 2 y) k [exp(-sy)-l]y 1 dy +s 


from which the saddle point s is determined by the equation 


£*ek (s) = 0 




(1+D 2 y) k exp(-sy) dy 


^ Nq | 1-exp (-s)+kD 2 [ s ^-(1+s ^)exp(-s)]| 


k * 0, 1 . 


Since the relation of the m.g.f.'s.by (2.51) does not hold for the 
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statistic g ft . the saddle point s under hypothesis 1L for both k = 0, 1 
must be calculated from (2. 69), which has only one root of real value 
as discussed in Appendix B. The nth derivatives of <f>^(s) f° r n ^ 2 
are given by 


° (“1) 8qq ( 1+d y) y n ^ exp(-sy) dy 


- l-V n Beo ' 1 N 0 { f n-l (s) + *> 2 f n (s) } 


lc - 0, 1 


(2.70) 


where 


f n (. s > 


y n exp(-sy) dy 


• -n-1 "1 / \ 

*= n; 8 - s expi*-s; 


[n! |(n-j)I] 8 - J 


j=0 


(2.71) 


Again, with the mean number Nq and the signal-to-noise ratio D given, 
the reliability of the 'statistic can be approximated by the asymptotic 
expansion as given in (2.55). For eachr decision level the saddle 
points Sq,s^ can be solved for from (2.69). The false-alarm proba- 


bility and the detector probability are plotted as a function of g 

2 

in Fig. 10 at Nq = 5 and D =0.1, 1, 2, 3, 4, and 6. Since the 


60 


C£ 



distribution of the threshold statistic g n is independent of the 
* 0 

signal strength under hypothesis H^, the advantage of the threshold 

detector is that for each preassigned the decision level is 

2 

fixed for all values of D as shown by Fig. 10. The detection proba- 
bility is also plotted as a function of the mean number N g by the 

dashed curves in Fig. 7 and Fig. 8 for typical values of Nq at 
-3 -5 

Qq * 10 and = 10 . The detection probability of the 

threshold detector approaches the curves of the optimum detector when 

2 

D is very small; however, the optimum detector always has higher 

detection probability. To calculate the minimum error probability 

P one has to solve the decision level g.* such that the ratio of 
e GO 

the probabilities for g Q - under and is equal to £/(!-£). 

We have not carried out the calculation. 



Figure 10 Probability Q q and probability 
vs. decision level g^. for threshold 
detector; N Q = 0.5* D 2 = 0.1, 1, 2, 3, 4, 6. 




Performance of the simple detector for a Gaussian image in the 


presence of the background light 

For the simple detector, the optimum statistic from the like- 
lihood ratio is the total number n of the photoelectrons emitted from 
the optimum area as given by (2.34). If the radius of the area 
A^' is given by aR o , the average number n^ of the photoelectrons 
emitted from A^ r under hypothesis for the Gaussian image according 
to (2,32) becomes 



* RdRdO 


*= air a 2 R 2 I,T + ka IT (l - exp(-R 2 /2)) 

O D S O 

- N Q [p + ld) 2 (l - e“ P )] 

k = 0, 1 , (2.72) 


12 2 2 
where p = — R and A, ’ = tt a R 
2 o 1 o 

Since the statistic g is a discrete random variable, when 

s 

the Ney man-Pear son criterion is used, randomization must be applied. 
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That is, the reliability of the simple detector according to (2.4) 
and (2,5) can be written as 


‘0 


‘TO , , ^ r - 8 s0 “ n T0 . f 

e / nl + f n mrt e / g g0 . 


n>g 


n -n„ 
‘TO 


TO 


sO 


Q d - 


n>g 


- n " n Tl - 

n T1 e / n! + f n, 


g s0 ~ n Tl . , 

T1 6 1 8 s0 • 


sO 


(2.73) 


where for preassigned the decision level g ^ is the smallest 
integer such that 


’sO 


^0 , n ; 


(n T0 ) / n! > 1 - Q, 


n=0 


(2.74) 


f is the probability to choose when n * g g g and can be determined 
from 


sO 



n=0 


r nT0 (n T0 ) n / n.f Q 0 - 1 


- "8en n < 


( n T0 ^ 


sO TO 
e g 


sO ’ » 


(2.75) 


where n TQ> n T1 are given by (2.72) for k *= 0, 1. Hypothesis H 1 will 
be then chosen if n > g gQ and H Q will be chosen if n < g^. As the 
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false-alarm probabilities Qq are preassigned at 10 and 10 , the 

detection probabilities of the simple detector are calculated 

from (2.73) at - 0.5 and 5 and are plotted as a function of the 

mean value N in Fig. 7 and Fig. 8. For each point on the curves, 
s 

the radius R q was varied to yield maximum detection probability. 

, . When the Bayes criterion is used, with given by (2.2), 
the decision level from (2.35) becomes 

g s0 = [*nA s0 +N 0 D 2 (l-e" P )] [£n(l+D 2 p _1 (l-e" P ))] _1 . 

(2.76) 


Hypothesis will be chosen if the number n > or will be 
chosen otherwise. The reliability of the detector can be evaluated 
according to (2.73), except that we let f = 0 because the probability 
for g^Q to be an Integer is negligible. When the relative costs are 
equal, the average error probability given by (2.6) is minimum and 
can be written as 



At £ » ht g g Q is calculated from (2.76) with = l,and the error 
probabilities are calculated and plotted as a function of mean 



number difference AN = N g at N Q = 0, 0.01, 0.5 and 5 by the dashed curves 
in Fig. 9. For each point on these curves the value R^ was varied to 
yield maximum P £ . 

When the false-alarm probability Q Q is preassigned, the 

decision level g determined from (2,74) is a mono tonically increasing 
su 

function of R Q . The optimum radius R 0 at which the detection proba- 
bility yields a maximum can easily be searched by the digital 
computer. The values of the optimum radius R Q at different values 
of N g are listed in Table 1 at typical values of Nq - 0.5, 5. The 
maxima are quite flat and the radius of the observation area is 
not critical. For the calculation of the average error probability 
when the decision level g^ is determined from (2.76), g g Q is also a 
mono tonically increasing function of R p . However, both Qq and 
are varied by changing g^^and their sensitivities toward the change 
are different. We found that there are several minima in a certain 
small region as we vary the radius R Q ,and we have picked the smallest 
one for the plots in Fig. 9, 



Discussion 


It is important in detection theory to calculate the distri- 
bution of the statistic as accurately as possible so that the statis- 

\ 

tical performances of the detectors can be investigated. The statistic 
given by (2.18) or (2.26) is the sum of N independent random variables 
and is very common in detection and estimation theory. Since the 
exact distribution of the statistic cannot be obtained, the Gaussian 
approximation has been largely used for the statistical evaluations 
in the past. On the other hand, the method of steepest descent has 
been proved mathematically to provide approximations of an accuracy 
that is often high; usually the error of the approximation is of the 
order of the first term neglected in the asymptotic expansion series. 
Furthermore, the asymptotic expansion series expressed by (2.55) 
gives the exact expression of a Gaussian distribution and a fairly 
good numerical agreement with the other two well-known distributions, 
the exponential and Poisson distributions, as discussed in Appendix A. 
It is therefore appropriate for us to use the method of steepest 
descent to approximate the tail distribution of the ideal and 
threshold detectors in order to investigate their performances, 
although more numerical calculations may be required such as searching 
for saddle point. With the availability of the digital computer at 
the present time, the asymptotic-expansion approximation should be 
very useful for solving many problems in detection and estimation 
theory. 
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Each of the detection statistics we have discussed is the 


sum of every ejected number of photoelectrons weighted by a factor 
as given by (2.18), (2.26), and (2.34). In order to design a better 
detector, we must make the false-alarm probability of the detector 
small and its detection probability as large as possible, or, 
equivalently, make the magnitude of the statistic as small under 
hypothesis Hq and as large under hypothesis as we can. From 
Figs. 7, 8 and 9 we can see that the ideal and threshold detectors, 
which utilize the information about the shape of image, are not much 
better than the simple detector when the optimum observation area 
of the simple detector is used to yield maximum detection probability 
or minimum error probability. In other words, to register the loca- 
tions of photoelectrons does not help much to improve the detection 
of the image. This is because when the background light is assumed 
to be uniform, the effectiveness of reducing the magnitude of the 
otatistic on the average under Hq or increasing the magnitude of the 

statistic on the average under is limited, especially when the 

2 

signal-to-noise ratio D is low. On the other hand, by varying the 
observation area, we can optimize the mean numbers of photoelectrons 
under and effectively and reduce the error probability or 
improve the reliability of the simple detector. 
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TABLE 1 . 

Optimum Radius for Simple Counter 


N 

s 

R o % " 10_3) 

r o (Q 0 - io- 5 ) 

1 

1.31 

1.37 

2 

1.72 

1.68 

4 

1.72 

1.68 

8 

2.10 

1.68 

12 

2.10 

1.98 

20 

2.10 

1.98 

24 

2.46 

2.28 


N & 5 
«0 D 

N 

C 

R ° (Q o = 10_3) 

R o (Q 0 - 10“ 5 ) 

1 

1.36 

1.37 

2 

1,44 

1.44 

4 

1.60 

1.58 

8 

1.60 

1.58 

12 

1.75 

1.65 

20 

1.82 

1.78 

24 

1.82 

1.78 
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Chapter III Simultaneous detection and estimation of the mean 
intensity of an optical image 

The detection of the image on a photosensitive surface 
described in Chapter II is based on the knowledge about the intensity 
and location of the light source. In practice, however, the estima- 
tion of the parameters such as the intensity 1^ and the center loca- 
tion .Xq of the image may be needed as well as the detection. The 
observer must estimate these parameters as best he can on the basis 
of the observed set of data jn. In this chapter we will assume that 
if the object image is detected, the location of the image is known, 
but its intensity must be simultaneously estimated. One seeks a 

A 

strategy to estimate the Intensity, I g “ I s Cli) » which is a function 

of the data ti - (n^,^ . * . n^) . Two most important strategies, 

( 12 ) 

Bayes and maximum likelihood * , will be discussed. A quadratic 

cost function will be used to derive the Bayes estimate. When the 
signal is not too weak, the Bayes estimate is approximately equiva- 
lent to the maximum likelihood estimate. An almost optimum estimate 
is therefore proposed and its statistical properties will be studied. 
The expectation of the biased estimate will be evaluated and plotted 
as a function of relative intensity at different observation times 
or total numbers N of small areas with 'a truncated Gaussian or 
parabolic image. 
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Simultaneous detection and estimation for a single parameter 


As we have discussed in connection with the binary hypotheses 
tests in Chapter II, when a set of data Jb - . . . b^} is 

obtained, the' observer will choose the hypothesis between the binary 
hypotheses and Since the signal received under hypothesis 

now depends on a certain unknown parameter 6, the observer must seek 
a strategy based on the N measurements of b_ so that when hypothesis 

( 3 ) 

is chosen, the parameter 0 will also be estimated . If, however 
Hq is chosen, it implies that there is only background noise; there- 
fore the parameter 0 does not exist or is equal to zero. The condi- 
tional p.d.f. P(b|e) is assumed to be given in general. If the cost 
functions C(§,6), which are the costs for choosing hypothesis Hq or 

A A 

and making estimate 0 " 0 or 0(b) for given true parameter 0, are 
given, and if the prior p.d.f. z(0) of the parameter 0 and the condi- 


> 


tional p.d.f. Y o (0|,b) of the estimate 0 for given dataware known, 

( 4 ) 

then the Bayes strategy can be used. The average risk per experi- 
ment for simultaneous detection and estimation of the parameter is 
given by 


C 




d\ o(0) Y o (0 |b) P(b|0) C[0,6] 


7 


( 3 . 1 ) 


where ft, A and Z are the spaces for parameters 0, 0 and _b respectively. 
The Bayes strategy is to properly choose the function Y o (0[b) 
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so that the average risk C can be minimized. Let us define 


Y o (0|b) «= ^(H 0 |b) 6(0) + n(e|b) , 


(3.2) 


where 6(9) is the Dirac delta function. ^(H^J^) is the conditional 
probability for the system to choose H^. Jl(o|b) is the conditional 

A 

p.d.f. for estimate 6 when H is chosen with observed data b^. Further- 
more, we have 



iKeJb) de = 


^(HiJb) j 


(3.3) 


where tfj(H^lb) is the conditional probability for the system to choose 
with given data _b and ^(H-Jjb) + ^(H^ljb) = 1. We also let 


0 (e) = c 6 (6) + (1-0 z(0) , 


(3.4) 


where £, l-£ are the prior probabilities for hypotheses and as 
discussed in Chapter XI. z(0) is the prior p.d.f. of parameter 0 
under H^. 

By substituting (3.2), (3.3) and (3,4) into (3.1) we can 
write the average risk as 



*(H 0 |jyc 


00 


+ 


/ 


d 0 JI(e|b) 


c i 0 (e) 


cy 
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de z(e) [^(H 0 |(b)) c ol (0) + 

c n(^ ,e) ] p | (3 * 5) 

where P Q (b) - P(b|H 0 ), 

is the cost of choosing when is true, 

C 01 (G) is the cost of choosing H Q when is true with 

parameter 0, 

^ * • 

C 1O (0) is the cost of choosing with estimate G when 

is true, , 

A A 

C^(0,6) is the cost of choosing with estimate G when 
is tru,e with parameter 0. 

Now since 

P(b|6) z(G) = P(e|b,) P(b) , (3.6) 

where P(0[b) is the posterior p.d.f. and 

P(b|0) z(G) d 0 , (3.7) 

we can also write the average risk C in terms of the conditional risks 





as 
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d N b P(b) 


[ C HqI fa + J de n(D|b) c(e |b) J (3.8) 


where 


C H 0 |b - *<“oly [? c 00 P 0^ 1 


+ (1-0 J de c Q1 (e) P(e|b)j 


(3.9) 


is the conditional risk when is chosen for given b. and 


c( e |b) = C C 1O (0) P 0 (b) / P(b) 


+ (1-?) de c u (e,e) p(s|b) (3.10) 


is the conditional risk when is chosen with 6 for given b^ow the 

p.d.f. P(b) and conditional p.d.f. H(9j^b)' are both positive over the 

space I of the outcome b> and the conditional risk C . is not a 

H_J b 

- _ 0 ~ 
function of estimate 0, the average risk C is minimized with respect 

to 0 by making the conditional risk C(6 |b) as small as possible for 

every set of data _b . Now • we choose the p.d.f. H(e|.b) as 


n(e|b) - iKHjb) 6(6 - 6(b)) } (3.11) 

* A 

where 0(b) is the optimum estimate 0 obtained from minimizing the 
conditional risk C(6|.b) and can be determined from the following 



equation 


— C(e|b) = 0 or 

96 


K P 0 ( fe) jr C 1O (0) + (1-0 P(b) J dO P(e|b) 


■4 c (6*0) • = 0 . 
ae 11 


(3.12) 


A A 


Thus the minimum average risk from (3.8) at 6 = 6(h) becomes 


''min 


■ A. jfi-KHjb)] [f;c 00 P 0 (b) 

+ ( 1-0 p(b) J ae c 01 (e) P(e|b)j 

+ [? c 10 (e(b)) P Q (b) 

.+ (1-0 p(b) / de c 11 (e(b),e) P(ejb)]j (3.13) 


If we now define 


A(y = c c 00 p Q (b) + d-o p< &>f a de c oi (e) p(0 li ) 


B(b,e(b)) = 5 C 10 (8(b)) P 0 (b) + (l-o P(b) 



then (3.13) can be written as 


'min 


/ d N b |A(b) + iKHjb) [B(b,0(b)) - A(b)]j . 


( 3 . 15 ) 


As the cost functions can be chosen so that both functions A(b) and 
B(b, 0(b)) are positive. Then we can further minimize the average risk 
from (3.15) so that 


H\\$) 


if B(b,0(b)) < A(b) , 


or 


♦ (HjJb) = 0 

f-(H A |b) « 1 


otherwise 


( 3 . 16 ) 


This is equivalent to defining a cost likelihood ratio A £ where 


(l-S) de[c ol (8)-c 11 (0(b),e)] p(e|b) 


« F o<& [c 10 (e(b))-c 00 ] 


( 3 . 17 ) 


and a decision level A^ - 1 so that hypothesis will be chosen if 
A c > A cQ and estimate 0(b) will be issued; otherwise, will be 
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choaen. 

When there is no information about the cost functions, the 
prior probabilities of the hypotheses, or the prior p.d.f. z(0) of 
^the parameter, the maximum-likelihood strategy can be used. The 

A 

estimate 0 will be determined from maximizing the posterior p.d.f. 

P(8 fjb), which is expressed in (3.6). Since P(b) is independent of 6, 
it is equivalent to maximizing the product of P(bJo) and z(0). With 
very little knowledge about z(0), we must assume that the prior p.d.f 
z(0) is very broad, so that z(0) will not affect the decision and 

A. 

estimate we have made. This implies. that the estimate 0 is simply 
obtained by maximizing the joint conditional p.d.f. P(b|o) or 
determined from the following equation 


30 


P(b|G) = 0 . 


(3.18) 


^ . 

The estimate 6 determined from (3,18) will be a function of the data 

k- ' 

When the data £ are a set of discrete random variables, the 

function P(bj0) will be the conditional probability function instead 

of conditional p.d.f. All the equations we have discussed in this 

chapter are still valid except that we have to change the integrals 

/ d^b into summations E . For the detection of the image and 
J E jb 

estimation of its intensity of a light source, both Bayes strategy 
and maximum likelihood strategy will be discussed in the following 
paragraphs. 
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The operation of simultaneous detection and estimation of 
the signal and its parameter 0 can be described by the following 
block diagram. 


A * 1 



Figure 11 Block diagram for simultaneous 
detection and estimation of a single parameter 
6 from the signal. 






/ 

Bayes strategy for detecting the image and estimating Its intensit y 
of a point source with a quadratic cost function 


When the photosensitive surface is divided into N small areas 
dA, the data ji are the numbers of photoelectrons ejected from the N 
6mall areas dA^ = dA, i = 1, 2 . . . N ; n is a set of N discrete 
random variables. As TW >> 1, they are Poisson distributed and 
statistically independent. The joint conditional probability under 
hypothesis can be written as 

P(n|l s =0) = P 0 (n) 

where 


N 


17 


.1 -V 




e * / n r * , (3.19) 


M = aT dA 


( 3 . 20 ) 


is the average number of photoelectrons ejected from each area dA 
due to the background light illumination. The joint conditional 
probability under hypothesis can be written as 


N 


pfely -T (p +i s v 1 


e / n. I 

1 » 


(3.21) 


where 


q ± *= aTYi dA , Y i -Y(x, i ) 


(3.22 ) 
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JLU 


with yOs^) as the weighting function given by (2.16). We let the 
cost functions described in (3.5) be 


C-io^g) c 10 , 


C U <I S ,I S > ” VW » 


(3.23) 


where C 00> C Q1> C 1Q , C Q are all given constants. 

The Bayes estimate I (n) can be determined from (3.12) 


i,fe) “ / I s Hl s |n) dl s , 


0 


(3.24) 


which is the conditional expectation^ of the intensity 1^. The 
posterior p.d.f. P(I |jn) can be expressed by using (3,6) and (3.7) 
as . 


P(lJn) = 


zd s ) p<a|r s ) 


Z (i s ) p<a|i s ) di s 


(3.25) 


'0 


for positive values I . The denominator of (3.25) can be written as 

O 


l 


2 (I C ) P(n|l s ) dig 


N 


f TT n i 

/ dl z(I ) I | (w+I q ) e S / n J . (3.26) 
0 b 5 i«l b 



1 


When the p.d.f. z(I ) is given, the Bayes estimate from (3.24) can 

i> 

be evaluated. ' 

We now assume that the p.d.f. z(I ) is a gamma distributed 

D 

probability function 


‘ C B B-l “ CI S 

Z( V ~ r(B) I s e u(I s ) 


with B, C real and positive constants. U(x) is the step function. 
If the object light is very much weaker than the background light, 
I << I, , (3.26) can be calculated approximately as 

D D 


Z(I C ) P(n|l ) dl 


S S 


dT T 8-1 P ~ CIs 

dI s r(B) x s 


’ n p 1 { i+n± \ y± ) / n± ' 

N 

7T e" 11 p 1 ^1 + a lYl cf^ ) / V ■ 


The Bayes estimate from (3.24) becomes 


I s 0l) 


-TT 


e -8 p 1 #[l+n jYj (B+l)/CI u ] / n ’ 


C v i i 


b J ' “i 


i D l n 

e“ y » 1 (l+n i Y i B/CI b ) / n ± i 



□ 


B 

C 


N 



i=l 




(3.29) 


where all the values n^, 1^, B, C are known. If the object light 

is so strong that we can assume that there are areas dA which are 
so close to the image center and such that >:> 1^ > anc ^ m Q areas 

which are far away from the image center and -y^I << l b (m Q + = N 

is the total number of areas into which we have divided the image) , 
then (3.26) can be calculated approximately 


z( y p <eiy di s 


m. 


- X > e" m ° v TT ^ v 1 


r(B) 


i=i 


B-l + M 1 »(CH-M )I C 

e q dI s 


(3.30) 


where 

i 



m o 



m l 



is the total number of photoelectrons ejected from the areas far 


from the image center. 
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Is the total number of photoelectrons ejected from the m^ areas close 
to the. image center, where 


B-l + M, -(C+M )I_ 

1 e q b dl. 


r (b + h^) 

(C + M q ) B+M l 


(3.31) 


The Bayes estimate X (n) according to (3.24) can be approximated as 


I s (n) * 


r(B+l + M x ) B+M x 

(C+M ) r(B+M.) = C+M 

q i q 


(3.32) 


When M^ » B we have 


m. 


I (n) - K n 

i> b i=l 


(3.33) 


with K, ^ « C + M . 

b q 



Maximum- likelihood strategy and the almost optimum intensity estimate 


When there is no information about the cost function, the 
prior probabilities of the hypotheses, or the prior p.d.f. z(I g ) of 
the intensity, the maximum- likelihood strategy will be used and the 

- A 

estimate I^(n) will be determined from (3.18); it satisfies the 
following equation 


N 


n i q i 


i~! i s Ca) + p 



(3. 34) 


When the object light is so much weaker than the background light 

A 

that I << I, , the estimate I (n) can be determined approximately 
b D b 

from 


or 

If the object light is so strong that there are m^ areas close to the 
image center and (n) >> P» and areas away from the image 

A 

center and q_^Ig(n) << \x so that they contribute little to the sum, 

A 

then we approximate the estimate I (n) from (3.34) by 

I c (n) — K n (3.36) 

S i-1 1 * 


N 


N 


i*=l i-1 

N 


n i q i y "* 1 ^ “ q i ^ 


N 


I c (n) = P 2 (n.p ^l) q. / n q 2 

b i«l 1 .i-1 


(3.35) 


110 



Ill 


r 


where 



The estimate I (n) we have just obtained resembles the approximate 

O 

form of the Bayes estimate given by (3.33) when the object light is 
not too weak. In general we may not have any information about the 
cost functions and the prior p.d.f. z(I ). We therefore propose an 

u 


almost optimum estimate I (n) as 

Sa 


N 


^ ■ K 


(n ± - e) 


- e) f 


(3.37) 


where U(x) =1 for x >0 and 0 for x < 0, K is defined in (3.36) . 0 is 

the threshold level which can be determined from a preassigned 
false-alarm probability Q q such that 6 is the smallest integer that 
satisfies the following equation 


0 

N u 11 e" ,J /. n! > N - Q (3.38) 

n c 0 


In other words, the number of photoelectrons ejected from each 
area dA^, i = 1, 2 . . . N, will be compared with the decision level 
0; if > 0, we will consider that some of the photoelectrons 
ejected from the ith area of the surface are due to the object 
illumination. We therefore will keep the term. If ^ < 0 we 
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assume that the number of photoelectrons ejected from the surface 
are due only to the background illumination; thus we will discard 
the term. When all the numbers n^, i - 1, 2 . . . N are less than 
6, the hypothesis will be chosen. Otherwise, the hypothesis 

At 

will be chosen, and the intensity estimate (n) is calculated 
according to (3.37). • „ 




Performance of the almost optimum intensity estimate for a 


truncated Gaussian or a parabolic image 


The estimate I^Cn) we have discussed is a function of the N 
measurements n. Since the data n are a set of random variables, no 

A 

two experiments will yield the same value of the estimate I (n) even 

S 

though the true value of the parameter I is the same in both. The 

s 

A 

most one can hope for is that the estimate I (n) will be close to 

u 

the true value I in the sense of "on the average". 

s 

A 

The mean and variance of the estimate I we have just 

Sa 

discussed can be derived as follows ' ■ 


E[I ] = E K 

S3 


(n_.~6) uC^-e) 


K E[(n -8) u(n,~e)] 

i=l 1 1 


n ± - 6 - 


(n -G) P (n |l ) 
n.=0 1 . s 


Var[l ] *= Var k (n -6) u(n ~8) 

Sa i=1 1 i 


.«E 


VarfCn^e) uC^-B)] 


2 y u - 

K JLj { n + n 


— 2 — 2 

± + n ± - 2n ± Q + 0 
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e 


where 


v 0 


n i ■ 6 - 


(n^e) P(n A |l s ) 

e ~»2 

(n^e) PC^Ilg) 


“i"° 


P(n i |l s ) = n i. i e 1 1 n i : . 


(3.39) 


n i = y + 9 1 1 s 


To investigate the statistical performance of the estimate as 
discussed in (3.37), a truncated Gaussian image will be postulated. 
The image surface with radius R q is divided into M small rings as 
shown in Fig. 12. 

The average number of photoelectrons ejected from each ring 
by the background light illumination is assumed to be evenly distri- 
buted and symmetrical with respect to the image center. The area 
dR^ of the ith ring cari be calculated as 


>iAp 


2ir 


dR, 


(i-l)Ap 0 


pdp d<f) 


ir(2i - 1) Ap « (2i - 1) A 


t 


(3. AO) 
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where - uA p 2 , ( Ap == p^ - f° r 1 - 1, 2 . . . H. Here 

p 2 « x ± 2 + x 2 2 with x ± = p cos <f>\ x 2 = p sin f as the rectangular 

coordinates used in Fig. 3. 

The total number of rings is 


M «= R /AP . 
o 


(3.41) 


If ‘we further divide each ith ring into a number (2i-l) of small 

equal areas dA, such that dA. = A , the total number of small equal 
’ i i o 

areas dA_^ from the whole image is 

M 

N - (2i-l) = M 2 . <3.42) 

i“l 



The distribution of the mean number of photoelectrons ejected from 
the image surface by the object light is described by the weighting 
function yQs^) as given in (2.16) and (2.56) for a Gaussian image, or in 
polar coordinates we can write 


r(p) = B Q e 


-P 2 /2o 2 


(3.43) 


where is a constant and a is the width of the Gaussian function. 
The average number of photoelectrons emitted from each small area 
dA^ (*=-A ) is assumed to be evenly distributed and is proportional 
to the intensity 1^. We calculate the coefficient 

/ 2it /»i Ap • ' 

/ y(p) pdp d<f> 

u •'(i-l)Ap 


aTB Q 2Tio 


r (i-l) 2 R A 2 /2N _ -i 2 R A 2 /2N 


(3.44) 


where 


q i (2i-l) 


(3.45) 


with q. defined in (3,22), R. * R /a. R is the radius of the 
1 a o o 

truncated image under observation, i is the index (i “ 1, 2 . , , M) 


Because of the presence of the background light, one would 



like to investigate the estimate I ga with respect to the true inten- 
sity I in terms of the signal-to-noise ratio. In other words one 
S 

could assume that the energy of the background light falling on the 
image surface within the radius R q of the truncated image is equi- 
valent to that from the object light with intensity 



(3.46) 


Since the total average number of photoelectrons ejected from the 
.surface of the image is proportional to the incident energy , the 
intensity of the point source can be described by 



d 2 1 


SO 


(3.47) 


where d 2 is called the signal-to~noise ratio for the truncated 
Gaussian image. 

The mean and variance of the estimate according to (3.37) 
can be written as 


M 


E[I ] = K 

ba i-1 


M 


(2i-l) 


Sa' 


'n ± - 6 - L-J 0^-e) 

v° 


- . - 2 


Var[l c J *= K Z-/ (2i-l) { n ± + n ± - + 6 


i-1 
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n ^*=0 


0 : 
(n -6) 2 P X C n i ) - n - 0 - X/ (iy-6) P^n^l 

L 1 n.-=0 J 


(3.48) 


- n i ~ n i 

where “ n j e / with n^ =? u + q^*I /(2i-l) and 

p and q_^ 1 are given in (3.20) and (3.45). 

A 

We also define the error of the estimate I by 

O cl 


• Err c < Var I sa ) 1 E[l Sa ] 


(3.49) 


as the ratio of the standard deviation to the mean value of the 

A 

estimate I given by (3.48). 

S3 

A 

To investigate the estimate I in terms of various parameters 
we assume the background light intensity corresponding to an object 
light with unit intensity 1^ = 1. The average number of photo- 
electrons ejected from each area dA due to the background light is 

4 

assumed to be 0.1 for an observation time T and coefficient a (i.e., 

JJ ™ aT I^A o 84 0.1). The false-alarm probability is preassigned at 
0.01, Where the parameters Bq and o are set at l,the expected value 

A 

of the estimate I„ is calculated at various ratios R. = 0.5, 1, 2 
and 3 and is plotted as a function of the relative intensity l c 
defined in (3.47) in Fig. 13. As the size of the observation area 

A. 

changes, the estimate I will also be affected. In Fig. 14, the 

S3 

A 

estimate I ■ is plotted as a function of the relative intensity I 

S3 S 



Figure 13 Mean intensity estimate 1^ vs. 

relative intensity I for truncated Gaussian 
-2 ^ 

image; Q q « 10 , the number of rings is 100. 

Mean background counts y = 0,1 at s a = 1 

for time T. Curves are indexed by parameter 

R. defined in (3.44). 

A 





Figure 14 Mean estimate I vs. relative 

ba 

intensity I c for truncated Gaussian image; 
-2 b 

Qq = 10 , y = 0,1 at = 1 for 100 rings 

for time T. Curves are indexed by the 
parameter M defined in (3.41). 




Figure 15 Mean estimate I vs. relative 

* 03 . 

intensity I g for truncated Gaussian image; 
Q q *= 10~ 2 , M «= 100, p = 6.1 at R A = 1 for 
time T. Curves are indexed by observation 


intervals T. 
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when the observation area is fixed but divided into three different 

A 

numbers of rings, 25, 50 and 100. In Fig. 15 the estimate I ga is also 
plotted as a function of I with three different observation times 
T, 10T and 100T. 

Observations also have been made by assuming the image as a 
parabolic function. That is (3.43) replaced by 

Y '(y) - B’(R 0 2 -P 2 ) 

*= 0 


Ip! < R „ f 

|p| > R 0 , 

(3.50) 



The mean, variance, and error can be calculated by using (3.48) and 
(3.49). At B 1 = 1 , = 1 we have plotted the expected value of 

A 

the estimate I as a function of I at three different values of 
S a S 

parameters M arid T in Fig. 16 and Fig, 17 with Q. = 0.01. The error 


of the estimate 1^ for the parabolic image is also plotted as a 



function of the average background photoelectrons p at fixed 
• Ig *= 10 and 20 in Fig. 18, where the error for p « 0.1 is 9.24% 
at I = 10 or 3.15% at I « 20 with the image surface divided into 
100 rings. The average number p is then varied either by changing 
the size of each dA or the observation time T« 



Figure 16 
intensity 
Vi = 0.1 at 
indexed by 


Mean estimate l n vs. relative 

2 

g for parabolic image; = 10 
R 0 - 1 for time T. Curves are 
parameter M. 




Figure 17 
intensity 
V = 0.1 at 
indexed by 


Mean estimate vs. relative 
s for parabolic image: Q Q = 10 
R 0 = 1 for time T. Curves are 
observation intervals T. 





Figure 18 Error vs. mean counts y 

-2 

for parabolic image; Qq = 10 . Curves 

are indexed by parameters Ig defined in 
(3.47) and M. 





Discussion 


A statistical model for simultaneous detection and estimation 
of a stochastic image is proposed. When the prior p.d.f. of the 
source intensity and the prior probability £ are known and the cost 
functions are available* the Bayes strategy leads to the optimum 
operation. If there is no information about the prior probability 
and the cost functions , the maximum- likelihood strategy can be used. 

A 

Upon examining the estimate I for both strategies, it is interesting 

A 

to note that the Bayes estimate Ig from (3.33) for. a quadratic cost 
function is approximately the same as the maximum-likelihood esti- 
mate from (3.36) when the signal is not very weak. 

For a Gaussian image, the bias and linearity will be improved 
when is increased; however, after is larger than 2, very little 
improvement will be gained by increasing farther. In order to 

reduce the t>ias of the estimate, it is always wise to use longer 
* 

observation time, to divide the observed image into a smaller number 
M of rings, or to use larger observation image area. In addition, 

' A 

a suitable constant should be added into the estimate Ig a to compen- 
sate for the bias. The observed image has an effective range of 
between 1 and 2 for the Gaussian truncated image. For the image of 
parabolic shape, one should always use the whole image area. As 
long as the observation area on the image surface is properly defined, 
the shape of the function •y(p) has no significant effect on the 
quality of the estimate. To reduce the estimation error, increasing 



the observation time is more effective than decreasing the total 


number M of rings . as we can see from Fig. 18. 

Vlhen the false-alarm probability is preassigned, randomi- 
zation can be used and the probability f for choosing when - 0 
can be calculated from the difference in (3.38) of 



The estimate I„ from (3.37) can be modified as 
Sa 


Sa - K 


N 


i=l 


n. U(n. - 6 - 0.5) + f6m 0 - C | (3.52) 


hiq is the total number of the areas where n_^ = 0 . C is some constant. 
Since the bias of the estimate approaches a constant as the intensity 
becomes larger ? as we can see from Fig. 15 or Fig. 17, a proper con- 
stant C can be found to compensate for the bias. 



Footnotes 
Chapter III 

1. Helstrom [10], Chapter VIII. 

2. Van Trees [20], Chapter II. 

3. Middleton [9]. 

4. Middleton [26], Chapter I. 

5. Helstrom [10], Chapter VIII, eq. (1.11). 
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Chapter IV Detectors for resolving two point sources in optical 
communication by photon-counting techniques 

The techniques we have discussed of counting the photoelectrons 
from the photosensitive surface for the detection and estimation of 
the object light intensity have great usefulness in binary optical 
communication. Since those devices, such as photomultipliers, image 
tubes, and image orthicons are well developed in commercial industry, 
photon-counting techniques, combined with adequate digital computer 
facilities can be used for many purposes. In this chapter we will 
further discuss some detectors used for resolving the point sources in 
optical communication by photon-counting techniques. The ideal 
detector described in Chapter II can be used for distinguishing two 
light sources of equal radiant power from a single source with twice 
the radiant power or for deciding which one of the two light sources 
is present. In the absence of background light, the performance of 
the ideal detector for deciding which one of the two light sources is 
present will be compared with two other receivers, the simple detector 
and the counting comparator. The intensities of the optical images 
due to the point sources will be postulated to be Gaussian distributed 
spatially. As uniform background light also passes through the 
aperture, the error probabilities of the simple detector and the 
counting comparator will be calculated with a finite observation area. 
When the point sources are mislocated, the effects on the ideal 
detector and the counting comparator will be discussed, and their 
average error probabilities will be calculated. 
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Ideal detector for resolving two light sources from the one light 


source 

Uhen two point sources with equal radiant power are very 
close to each other, they may often appear to the observer as one 
source with twice the radiant power, located midway between them. 

The criterion given by Rayleigh for two point sources to be resolved 
is that the peak illuminance of the diffraction pattern of one source 
will not fall closer than the first minimum of the diffraction pattern 
of the other. In this section, we will construct the optimum statistic 
for the resolution of the two point sources from the standpoint of a 
classical hypothesis test^. The observer will choose between two 
hypotheses, (H^) two point sources whose images have equal intensities 
are present at ^x and -x in the image plane, and (Hq) one source with 
twice the power is located at the origin. The optimum statistic g 
described by (2.18) in Chapter II, based on observing the set of n 
photoelectrons, can still be applied here. The intensity I^(x) given 
by (2.15) can now be written as 

I k^i > = I b + I S Y kfei ) ’ for k = °> 1 - <4,1) 

where the weighting functions are given by 

= I [y(x 11 " kx 0’ x 2i ) + Y(x li + kx 0’ x 2i )] ’ <4,2) 

and l( x Li » x 2 i^ = YO^) is described b y (2.16). I fe is the uniform 
intensity of the background light. Ig is the intensity of the point 
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source, whose image is centered at (xq, 0) or (~Xq» 0) on the 
photosensitive surface under hypothesis as shown by Figure 3. 

The optimum statistic for binary detection of the point sources 
can be derived from the likelihood ratio in a way similar to (2.18), 
and we write 


5 R 


N 



(4.3) 


where the function H^(x_^) defined in (2.17) becomes 

I b [1 + xr Y o^ )] 

D 

“ 1 + D 2 [u 1 (x i ) “ ^(x^Hl + D 2 u 0 (x j .)] 1 (4.4) 

2 

, and u. (x. ) «=’ A Y, (x. ) , k * 0, 1, D = I /I A . Here A is an 

k i o k i d bo o 

arbitrarily defined finite area, as discussed in Chapter II. 

Since the weighting function lies between 0 and 1, 

0 < Y^(x) < 1 for k = 0 or 1. We will have 0 < H^^^) < 1 for some 
areas where Yq(x) > y^(x). This implies that the statistic is no 

longer a sum of non-negative random variables. As TW >> 1, the m.g.f. 
of the optimum statistic g^ according to (2.19) from Chapter II 


becomes 
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h Rk (s) “ gXP 


{aTI b yd 2 x[l+D 2 u t C s )][(H I |;x))- S -l]|, k=0,l , C4.5) 


where A is the area of the receptor, T Is the observation interval, and 

a is the coefficient defined in Chapter II. Since the average number 

N 

difference AN “ £ ^ n il n i0^ vanishes in this case, the statistic 
g R from (4.3) is the logarithm of the likelihood ratio defined by 
(2.11). It is easy to show h R1 ( s ) ® h Q (s-1). Also when Bayes 
strategy is used, the decision level is zero when the prior 
probability is £ « j as we can see from (2.13). 

In order to use the numerical method of steepest descent, the 
complex phase, defined by (A.7) in Appendix A when (4.5) is used, can. 
be written as 


^Rk (s) = % 0 ” laTI b^ d2 ~ [1 +d2 \^^(Hr^))~ S “ 1 ] +s » k= 0,l .(4.6) 


The saddle-point will be determined by solving the equation 4w(s)=0 

KK 


g R0 = H R (x), for k=0,l .(4.7) 

A 


The nth derivative of the complex phase for both and H Q as 


2 r , .~2 


,( ’Rk (s) = (- 1 )“aTI b Jd‘x[l+D i u k (x;KH R (x)) k - s in ” ^(x) > ( 4 . 8 ) 


for k=0, 1 



and ^^(s^) = f° r n ^ 1 where s R is the saddle-point 

under for k = 0,1 and satisfies (4.7) and (2.51). Because the 
function Ii^(x) has the term Yq ( x) .in the denominator as given in (4.4), 
the evaluation of (4.6), (4.7) or (4.8) will involve a double 
integration. The iteration procedure to search for the saddle point as 
discussed in Chapter II will be tedious numerically although it can be 

■ -t. 

carried out by the digital computer. In the meantime, the nth cumulant 
under hypothesis for k = 0 or 1 is 

C-l )n 4— in h^(s)l s=0 
ds 



H R (x),for k=0,l # (4.9) 

If all the nth cumulants for n > 2 are negligible in comparison with 

the second cumulant, the distribution of the statistic g can be 

K 

approximated by a Gaussian function. The mean and variance of the 
statistic can be calculated from (4.9) at n = 1 and n = 2. We have 
not carried out any numerical example for this ideal detector. However, 
we will investigate the performance of the ideal detector with a 
Gaussian image in the next paragraph for the decision whether a single 
source is located at x or at -x . 


‘(-l) n aTI 



D? ’ u k 3 


n 



Ideal detector for binary detection, in optical communication 

In order to transmit the information about the bit "1" or the 
bit "0", an optical system can be used by focusing a radiant source 
either oil the upper half plane or the lower half plane of a photo- 
sensitive surface* The observer will choose between the hypotheses: 

(H 1 ) the bit "l" and (H Q ) the bit "0" on the basis of the set of 
photoelectrons ejected from a large number of small areas of the 
photosensitive surface. The intensity of the image of the light source 
at point (x_) of the surface under hypothesis for k = 0 or 1 in the 
presence of the uniform background light can be described by 

W - VVA) ' (4 - 10) 

with Y k (x^) = Y(x li , x 2i + (-l) k x Q ), for k = 0 or 1. 

where y( x 1;L » x^) is wei £ htin S function and (0,x Q ) or (0,-x Q ) is 
the center location of the image on the upper half or the lower half of 
the photosensitive surface> and is the uniform intensity of the 
background light. 

The optimum statistic from the likelihood ratio can be 
expressed according to (A. 3) with 11^ (x) as the ratio of I^(x) and 
Ip(x) defined by (4.10). , g R will be compared with the decision level 
g RQ , and hypothesis will be chosen if g R > g RQ . Otherwise hypothesis 
Hq will be chosen. The m.g.f. of the statistic g R and its cumulants 
will be involved with double integration as discussed in the previous 
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paragraph. However, when the background light is negligible or absent 
so that I, = 0, the distribution of the statistic can be derived 
directly from the inverse Fourier .transform of the characteristic 
function when Gaussian images are postulated, and the performance of 
the ideal detector can be analyzed as -follows. 

The function H^(x) described in (4.4) at 1^ = 0 becomes 

.* I lfei^ I 0^i^ 

*■ , (A. 11) 

where for the Gaussian images we have 

= e X p{- [ Xl . 2 +(x 2 . + (-l) k Xo ) 2 ]), (4.12) 

ZT TO ZO 

for k = 0 or 1 

> 

and the function (x ^ ) from (4.11) becomes 

= exp(2 X() x 21 /a 2 ), x Q > 0 . (4.13) 

Thus the statistic will be the sum of the numbers of photoelectrons 
ejected from each small area dA^, i = 1, 2... or N, weighted by the 
coordinate x 2i of the center location (x^, x 2i ) of that area. As we 
pass the limit dA^ -*■(), the m.g.f. of the optimum statistic can be 



written as 


h Rk (s) = EX P 


exp 



- s 


0 ** 14 ) 


for k~0,l 

with N g = aTI^ as the total number of photoelectrons emitted from the 
photosensitive surface during the observation interval (_0,T) because of 
the light from the source. 

If we expand the right-hand side. of (4.14) into a series of 
x q o k 

exp[2(— ) (C“l) + s) s] we have 


N 


n 


V Cs) * exp(-N g ) exp 

n=0 


„ 2 21 
a s 

-M , s + 
nk 2 


(4.15) 


for k=0,l 


x n 2 

where - -2n<^) (-1) R 


for n-0,1,2. 


n 


Xq 2 

4n0 


k=0,l 


(4.16) 


The m.g.f. h^Cs) from (4.15) is a convergent power series where the 


nth term of the series is the m.g.f. of the Gaussian random variable 


with mean M nk and variance cr^ given by (4.16) and is weighted by a 
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factor exp(-Ng) N g n /r»J. Tlie p.d.f. of- the statistic g R can be 
obtained by talcing the inverse Fourier transform of its characteristic 
function and can be written as 

P k (g R ) 88 exp(-N g ) jfi(g R )+ 

exp ^“2 (% " M n fc> 2 Jj for k=0 * 1 (4.17) 

(3) 

where <5(g„) is the Dirac delta function 
R 

The p.d.f. of the statistic g R has the delta function at the 

origin. The probability that there is no pliotoelectron ejected at all 

is exp (-N ) . . . 

s „ 

The distribution of the statistic under hypothesis for 
k=0 or 1 is 



P r [g R S g R0. ] ~ 1 " 


\( g R ) dg R 


03 „ n 


i „ x V** K s r / 8 R0 " M nk\ f 

1 - exp(-y erfc I-— 1, for g R0 > 0 

n=l ' n ' 


n 


1 - exp (-N ) [ 1 erfc ( -^ ] , 

n=l x n * 


for gl>n <0 


(4.19) 


with a and M given by (4,16) and the function erfc(x) defined by 
n hk 

(2.44) in Chapter II. 
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The 


nth cumulant of the statistic can be also derived as 


follows , 


G„,- n *= - (“l) n — ~ h_ (s)|s=0 


n 


Rk 


ds 


n Rk 


(-i) 11 H (-j 2 ) f exp [- -^(x 2+ (-l) k x 0 ) 2 ] 

s 'o *'-«> /2rr.o 2 a 


0 ' .'-.CO /2tT . o 

/2x vn 

- »e(y) Vc> f0r k= °> 1 > 


k .2, n 

x 2 dx 2 


(4.20) 


where y 


nk 


v'TjT a 


exp[ ^ 2^2 + x q) ^ x 2 ^ x 2 


1_ 

2o' 


, x n n 

a 11 E[X - (-l) k -f] 


n 



- ° n ^f)(-D kr 0 r 


is the nth central moment ^ of a zero mean Gaussian random variable 
X with unit variance and 


E[X n ] = 1*2*3... (n-1) for n even 

l n \ _ n! 

n odd, \ */ r! (n-r) ! 


for 


The average error probability according to (2.6) is equal to the Bayes 
cost when the relative costs C^q - Cqq and are equal and 

C 01 “ C 10 “ ly C 00 = C 11 = °* The decision level g RQ will be set at 
zero when £ - “ , and the minimum error probability of the ideal 



detector can be calculated from 


P e ' -K + I a -V 

An" x 

= — exp(-N ) [1 + 2 ” erfcCyrT—)] CA.21) 

n= 1 11 ■ 

for n-1,2 , . . . 

At N » 0.1, 0,5, 1, 2 and 5, the error probabilities of the ideal 

detector have been calculated and plotted as a function of the ratio 
2x 0 

( ) by the solid curves in Fig. 19 where 2x A is the distance 

o 0 

separating the images, which are centered at (0,x^) and (0, -x^) of 

the surface, a is the width of the Gaussian image described in (4.12), 

The error probability P e will be reduced as the distance 2x^ between 

the two images increases. However, P £ will be limited by a value of 

“ exp(~N s ) as the ratio 2x^/o approaches to infinity. Therefore, the 

larger the value of N , the better the performance of the detector 

s 


will be. 



Figure 19 Average error probability P £ 
in binary bits detection vs. distance of 
separation (2 Xq/o) between images for 
ideal detector, counting comparator, or 
simple detectors in. the absence of back- 
ground light; N - 0.1, 1, 2, 5. 






Simple detector fox* detection of binary-bits 


When background light also passes through the aperture and 
corrupts the object light, a simple way is to measure the total number 
of photoelec trons ejected from, a finite area A* of the upper half 
photosensitive surface centered at (0 ? Xq). The simple detector 
described. in Chapter II thus can be used here to detect the binary bits. 
As TW >> 1, the number of photoelectrons ejected from the area A' has a 
Poisson distribution. The conditional probability under hypothesis 
for k = 0 or 1 is given by (2.3) where the means can be determined 
from 



"n = aT / f k , [I b + h y(x i> x 2 ~ x o ) 1 dx i dx 2 , 



n T0 = aT /^, ^ X b + I S y ^ X V X 2 + x 0 )] dx l dx 2 * (4.22) 


o o 

-For a Gaussian image, with y. (x) given in (4.12) and A* = irR n a is 

K- U 

the circular area of the upper half surface where x 2 + (* 2 - x Q ) 2 sC R^a 2 
We have 


n, 


T1 


“ T # /a' {Ib + Is eXPt " +(Xz " X 0 )2)])dX I dX 2 

2 2 h f* r° R 2 

arR^ o TI, + aT 7T f / exp(- y) RdRd8 

0 Jo z 


‘o ~ :*b • 2 it 


V I R o + d2(1 ■ e . x P(-Ro /2)) ] , 
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n T0 


aT / /[I v +I - 1 - -r exp[- --™-(x^(x 9 H-x n ) 2 ) ]]dx- dx~ 
•/Va’ b S 2na 2 2a 2 1 2 ° 12 

? 2 *q f* R 2 2Rx o 2x 0 2 

T V“ T 2f/ 0 J -Pt- f - cos °- -7- ] 


RdRdO 


V |r 0 + d 2 (i - Q( ^ , R 0 ))3 , 


(4.23) 


where Q (a , h>) i s the "Q function"^ for constants a and b , and 


N n = 2ira aT I, 
U b 


D - N 0 /aTl S = V N s ■ 


is the average number of photoelectrons ejected, from an area 2irc 

2 

during the interval (0,T), D is the signal-to-noise ratio as defined 
in (2.57) and (2.58). 

If A' is a square area, that is, we- only consider the area 

1 2 2 

A 1 = 4Rq 'a for |x^ - x^| £ R^a, jx^J £ R^a in the upper half plane, 
the means can be calculated as 


V 


n, 


T1 


aT 


dx. 




■w 


W 


dx. 


\h + hT~2 exp{ “ 7T tx i‘ +(x - 


2ira 


2a 


- V i >f 
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- V a2 “ TI b + aT 27 


2R, 


I C , A R 

S r 1 exp (- |-)dR 


2 ,2 


-R 


0 


N Q [ + D Z (1 - 2 erfc(R Q )) 2 ] , 


•V /- K o +R o° 


dx. I dxjl +1 


n ™ “ ° T lR 0 o^ 1 V 0 a'"' 2rb '; S 2xa 2 


exp[* 


2R 


-i- (x 2 + (x„+x 0 ) 2 ]| 

2a 

2 

0 . ^2 


2x, 


= N n l — + D (1 - 2 erfc(R n ))[erfc( — 

0 1 it 0 a 


V 


2x 


- erfc(— - + R 0 )3 } . 


(4.24) 


The observer will compare the number of photoelectrons n ejected from 

the area A* with a decision level g R ^ such that will be chosen if 

n > g nn and H n otherwise. The decision level can be determined 
RO U 

according to (2.33) and < 


'RO 


(£n A 


so 


+ n 


T1 


- n TQ ) [5,n(n T1 /n TO )3 


-1 


(4.25) 


where A s q is the decision level on the likelihood ratio and can be 
calculated according to (2.2) when the cost functions are given. For 
the Neyman Pearson strategy, randomization must be used, and g^Q is then 
an integer, which can be determined’ from (2. 74), where is preassigned. 

The reliability and the average error probability can be calculated by 
using (2.73) and (2,6). As the background light also passes through 
the aperture, there is a certain optimum size of the observation area 



at which for a fixed value of the false-alarm probability, the 
detection probability is maximum, as we have discussed' in Chapter II. 

The result and discussion would be the same if the observation 
area A* were taken from the lower half of the photosensitive surface 
and centered at (0,-x^), We only have to change the weighting function 
by 


Y k fe) 


= y(x 1 ,x 2 - (-1)' 


x Q ), for k= 0 , 1 


(4.26) 


where the mean numbers n ff Q and n^ observed from the lower half plane 
will be the same as given by (4.22), (4.23) or (4.24). 

In order to compare with the ideal detector just discussed, 
we shall investigate the performance of the simple detector in the 
absence of background light. The observation area A* will be now 
taken as the whole upper (or lower) half plane. For a Gaussian image 
with Tj^Cs.) given by (4.12) the mean numbers can be written as 


CO 00 




7 exp[- ~iy(x 2 +(x„ - x n ) 2 )] 
0 - 2 tto 2 o 1 ’ z 


V 1 - erf c( ~ )] , 


co oo 


ho 


= N / dx, / dx_ — 

S jL Vo 2 2 r 


12 2 
2 ex Pt 2 ^ X 1 + ( x 2 +x n^ ^ 

2ra 2a 


N erfc( — ) 
s a f 


(4,27) 



When the relative costs are equal and £ - j , the decision level is 
g RQ = (n n - n T0 )[£n(n T1 /n T0 )3“ 1 for the Bayes strategy. The error 
probability P is calculated according to (2.77) and plotted as a 

function of the ratio (2x n /a) by the dashed curves in Fig. 19 at 

' ' 2x 0 
N g = 0.1, 0.5, 1, 2, and 5. As the ratio (-^— ) approaches infinity, 

where n N and n n ->- 0, the error probability P will be also 

limited by — exp(~N ) , which is half of the probability for n=0 with 

4 S 

the average number N g . 

The error probability P of the simple detector in the absence 

of the background light can be further reduced if the optimum 

observation area is considered. That is, instead of taking the whole 

upper half plane, the observation area will be moved up from the center 

line by a distance x^ such that the error probability according to 

(2.77) is minimum. In other words, for the Gaussian image we can take 

the proper integral range of x 0 from (4.27) as (x ,<») instead of (0,«>) 

a 

as x > 0 so that the probability P can be. further minimized. For 
d e 

N g = 5, the typical values of x^ at different values of ratio (x Q /a) 

are listed in table 2. The minimized P is plotted as a function of 

e r 

(2Xq/<j) in Fig. 20 where the error probabilities of the ideal detector 
and the simple detector at x^«0 are also plotted. 

When the background light also passes through the aperture, 
the observation area A* for Gaussian image discussed in (4.23) or 
(4.24) can be used. With the square area A = 4a at Rq= 1, the means 
are calculated from (4.24), and the error probabilities of the simple 
detector are plotted as a function of N g for - 0, 0.1, 0.5, and 



1 at x^/a = 2 in Fig. 21 by the dashed curves. In Fig. 22, the error 
probabilities are also plotted as a function of at x^/c = 1 and 
Xq/ct = 2 for the square area A* = by the dashed curves. When the 
mean number n^.^ is fixed at 0.5, 1 or 2, the error probability of the 
simple detector is then plotted as a function of the number difference 
AN - - n^Q in Fig. 23 by the dashed curves. 



Figure 20 Average error probability P in 

binary bits detection vs., distance (2 x^/cr) 

for ideal detection and simple detector with 

fixed area or optimum area in the absence of 

the background light, N =5. 

s 



\ Simple Detector 
, (fixed area) 

\ \ Simple Detector 
\ (Optimum oreo) 


Ideal Detector 




Figure 21 Average error probability P 
in binary detection vs. average number 
for simple detector, counting comparator 
with square observation area A* = 4 at 
Rq ~ 1 and x Q /o =2, N Q = 0, 0.1, 0.5, 1. 


<T> 





Figure 22 Average error probability in 

binary detection vs. average number N for 
. s 

simple detector and counting comparator with 

i 2 

square observation area A ~ Ao at = 1 
and Nq = 0.1; x^/cr ~ 1 » 2. 
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Figure 23 Average error pi*obability 
in binary detection vs. average number 
difference AN for simple detector and 
counting comparator where AN ~ - n TQ 

with n,^ as the mean number of photo- 
electrons observed from a finite area of 
one-half of the surface under hypothesis 
for k=0, 1; u tJ , Q « 0.5, 1, 2. 








Cou nting comparator for detection of binary bits 

Another simple receiver for detecting the radiant source located 
at either (0,x Q ) or (0,-x Q ) s similar to the receiver discussed by 
Peters and Arguello (2) , measures the total number n u of photoelectrons 
emitted from the area A* centered at (0,x Q ) in the upper half photo- 
sensitive surface and compares it with the total number n^ of 
photoelectrons emitted from the area A centered at (0 ,-Xq) of the 
lower half plane (a'=a"). The observer will choose H 1 if n u > n L and 

H if n < n . When n = n T hypothesis H will be chosen with a 
0 u L u L 1 

probability f. The false-alarm probability and the detection 
probability according to (2.9) and (2.10) can be written as 


Q 0 - Fr[n u > n L |H 0 ]+£, P K [n u = n L |H 0 ] 


03 DO 




I’ (n+m) P.. (n)+f p 0 <n) P,(n) , 

n=0 m=l h=0 



Q d - Pr[n u > +'£ Pr[n u - 


03 03 




P 1 (n+m) P Q (n)+f Pj_(n) P Q^ n ^ > 

n=0 m=l n=0 



where P^Cn) = e ^^/n! * ^ or > 


(4.28) 


with means and given in (4.22), (4.23), or (4.24). 
(4.28) can be expressed in simpler form. Since 
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Pq (n + in) P^n) 


n-0 m=l 


_ n+m 
n T0 6 


" n T0 n T " e n Tl / n! (nrfn) ! 


- ( n T1 + ^ 0 ) 


r“ \ m 

n T0 \ 2 


m=l \n. 


X m (ab) 


- 1 - Q (M) 


where I (x) - 

n-0 


(x/2) m+2n / n! (n+m)! 


is the modified Bessel function 1, ; for ra=0,l,2... and 


Q(a,b) = 1 - e 


- }(a 2 + b 2 ) 


► /bV 

=rtd I I 
=1 U/ 


I (ab) 

m 


is the "Q function"^ with 


a « V 2 n T1 , b = V2n TQ 


Also we can write 


p i(n) P Q (n) Z-/ ( n T 0 n Tl 

n=0 n=0 


("to " T l> n e " <nT0 + nTl) /(n0 2 


1 , 2 . ,2. 

- J (a + b ) 

c I n (ab) , 



error 


(4.32) 

The error probability from (4.32) of the counting comparator has 

been evaluated in the absence of the background light for the Gaussian 

images with the mean numbers under and as given by (4,27). The 
* « 

error probabilities P £ of this receiver are plotted as a function of 

the ratio (2x n /c) at N = 0.1, 0.5, 1, 2, and 5 in Fig. 19. The error 
u s 

probability decreases as the ratio (2x Q /c) increases. As 2 xq/c 
approaches infinity, the error probability will be limited by 

I.e*P(-N 8 ). 

When the background light also passes through the aperture, the 

error probabilities of this receiver are also calculated with a square 

observation area A* = 4c^ with R = 1 and plotted as a function of N 

0 s 

in Fig. 21 by the solid curves at = 0, 0.1, 0.5, and 1 for x^/c - 2. 

In Fig. 22, the error probabilities are plotted as a function of N 

s 

when the same observation area A = 4c is used and the average 
background dumber is fixed at 0.1 for x^/c - 1 and 2. In Fig. 23 
the error probabilities are plotted as a function of the number 


Therefore, for equal prior probabilities or £ = the average 
probability for making the decisions is 


\ % + 1 (1 




00 CO 


P 0 (n4m) P (n) + f P Q (n) P^n) 

n=0 m=l n=Q 


= 1 - Q(a,b) + exp (- |( a 2 + b 2 )) I Q ( ab ) . 



difference AN by the solid curves at fixed values of average number 
n 0.5, 1, and 2. As the size of the square area A* is varied, the 

iU 

error probability changes. The error probabilities of the counting 
comparator are plotted as a function of the length in Fig. 24 at 
different values of N^, 0.1, 0.5, and 1. 



Figure 24 Error probability of. the counting 

comparator vs. the length R Q for the rectangular 

observation area; N =5.' The curves are indexed. 

s 

by the parameter defined in (2.57). 





linage location effects on the detectors in the absence of, j ack-ground 

light 

The Investigations of the detectors of binary bits that we have 
just discussed are based on the primary assumption .that we know that 
the image Is located at either (0,x 0 ) or (0,-x Q ) exactly. We would 
like to investigate the performances of those detectors when the 
distance between the image is still 2 Xq, but the images are mislocated 
with a shift of ±e. That is the images are actually located at (0,x 0 ') 
and (0»- x o') under hypothesis and where x Q = + e, - d 

or x 0 * - x Q - e, x 0 " = x Q + £ for e > 0. 

When the ideal detector is used, the expression of the function 
1 ^( 3 ^) from (4.13) can still be used, where the weighting function 

Y k^i> f *° m wil1 be re P* aced with x o = X o' for k = 1 and 

x Q = x Q n for k=0. We can derive the m.g.f. of the optimum statistic g R 
following (4.14), and the expression of the m.g.f. h Rk (s) from (4.15) 
can still be used with M , now replaced by 


M 


nk 


X 0 X 0 k 

- 2n (-1) » for k=1 


X X 

- 2n (“l) k , for k=0, n=0,l,2 


(4.33) 


and o n unchanged . 

The false-alarm probability and detection probability with zero 
decision level can be expressed according to (4.19) as 
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N 

exp (-H ) n , 

n-1 


n x " 

~~ erfc (yxT — ^ — ) 


o f 


Q d - VB^olV 


*» exp (-N ) 
s 


n=l 


N 

s 

n! 


n x • 
f~ (1 - erfc (i^n ”™ )) 


(4.34) 


The average error probability for equal prior probability or 


K ~ ~2 can b e written as 


P e ■" - V 


E N n x " x ' 

~f— (erfc (/n ~~) + erfc (/n ) ] 

•5 -11. V .O • 


n-1 


(4.35) 


The error probability will be the same for either x^’ = x^ + e 

X o" “ x o “ G or x q* = ” e > x o" “ x 0 + G * ^ at * s j (4.35) is the 
average error probability of the ideal detector when the images are 
mislocated with a shift either e or -e. 

When the counting comparator is used, with the distance between 
the images fixed at 2x^, the mean numbers due to a shift e or -e of the 
actual image can be expressed as 
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/ x 0 * G \ 

n Tl± * N s fl “ erfc ( — / 1 * 


K-) , 


n T0+ ” N s erfG 


(4.36) 


where n^ + , are means for a positive shift e and n T ^„» 

are the. means for a negative shift. -e. The false-alarm probability 
and the detection probability of the counting comparator when the 
images are misolcated at (0,Xq) and (0,-Xq) under hypothesis and Hq, 
while actually located at (0,Xq+c) and (0,-Xq+e) or (0,x^-e) and 
(Oj-Xq-c), can be written as 


Q 0± - Pr[n > njH 0 ] + £ Pr[» u = njty 


L ^ p 1? (n) P Q+ (n+m) + f p 1? (n) P Q±^ » 

n-0 m^l “ n-0 


Q di = Pr[„ u > nj^] + f Prln, - njH 1 


JL.4 JL*§ p 1+ (n) p 0 -( a +m) + f p 1± (n) P o+^ n) , 

n=0 m=0 “ n=0 


(4.37) 


where p k+ (n) = e /n! , for k=0,l . 


and are the false-alarm probability and the detection 
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probability for a positive shift e while Q q _, Q d _ are the probabilities 
for a negative shift -e* 

For equal prior probability ^ ~ \ anc ^ ^ ~ ~ 2 7 avera 6 e error 

probability will be the same for either a shift e or -e and can be 
written as 



where a + - , b + -/ 2 n TOt and ( K a > b >> I o (ab) are given ln (4-29) ' 

P is the error probability for a shift of e and P is the error 

probability for a shift of -e. P e+ = P e- as we can see from (4.38). 

The error probabilities of both the ideal detector and the 

counting comparator are calculated at different values of shift e when 

(4.35) and (4.38) are used. In Fig. 25, the error probabilities of 

these two detectors are plotted as a function of the ratio |g|/xq at 

N g “ 2, 5 and 8 with x^/a - 1 . In Fig. 26 the error probabilities are 

plotted as a function of the ratio |e|/xq at x^/cr = 1.5, 2 and 4 with 

fixed value of N at 5. 

s 



Figure 25 Average error probability ? e 
in binary bits detection vs. the ratio 
|e|/xQ due to the mislocation of the 
images for the ideal detector or the 
counting comparator at x Q /o = 1; = 2, 

5, 8. 
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s 


Figure. 26 Average error probability P in 

V- 

binary bits detection vs. the ratio | e | /x^ 
due to the mislocation of the images for the 
ideal detector and the counting comparator 
for fixed value N g = 5; x^/a = 1.5, 2, 4. 
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Discussion 


In the absence of uniform background light, registering the 
locations of the ejected photoelectrons for the detection of either 
one of the two Gaussian images, as we have discussed in this chapter, 
not only utilizes more of the. information, but also effectively reduces 
the mean value of the optimum statistic under Hq and thus improves the 
performance of the ideal detector. As for the simple detector, the 
gain by optimizing the observation area will be limited because 
varying the area will affect both the mean numbers and as 

defined in (4.27). Therefore, the performance of the ideal detector 
is better than that of the simple detector, as we can see from Fig. 19 
and Fig. 20. When the counting comparator is used, the observed 
numbers of the photoelectrons ejected from both the upper half surface 
and the lower half surface will be used, while for the simple detector 
only the number of photoelectrons ejected from the upper half surface 
,(or the lower half surface) will be utilized for making decision. In 
other words, the counting comparator utilizes more information than 
the simple detector, but less information than the ideal detector. 

Thus the performance is better than that of the simple detector, but 
not as good as that of the ideal detector as shown in Fig. 19. 

When background light also passes through the aperture, the 
average error probability for making decisions will be increased. In 
the meantime, the observer must define a finite observation area 
when any one of the detectors will be used. For the simple detector or 
the counting comparator, the finite observation area can be chosen such 


177 



that for a fixed false-alarm probability, the detection probability of 

the detector will be as large as possible. The proper size of the 

square area for the counting comparator to be used at 0.1 £ £ 1 is 

R q - 2 or A* = 16a as we can see from Fig. 24. For a finite square 

2 

area 4a , the average error probabilities for both simple detector and 
counting comparator will increase as the background light increases or 
the ratio /a decreases as shown in Fig. 21, Fig. 22, and Fig. 23. 

When Gaussian images are postulated, the performance of the 
detectors for detecting binary bits will also depend on the prior 
knowledge about the locations of the images. The average probability 
of making errors in each decision will be increased if the images for 
bit M l n and bit "0" are mislocated. The severity of mislocation will 
also depend on the distance 2x^ separating the images. For example, 
when the ratio x Q /a ~ 4, the effect on either the ideal detector or 
the counting comparator due to mislocation will be insignificant until 
the ratio of shift |e|/x q is greater than 1.2, as we can see from. Fig. 
26. Furthermore, the effect on the counting comparator of mislocation 
of the images will be little smaller than that on the ideal detector 
as shown in Fig. 25 or Fig. 26 and at Xq/o - 1-5; for example, the 
average error probability of the counting comparator becomes smaller 
than that of the ideal detector when jej/x^ is greater than 1.6, as 
shown in Figure 26/ 
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Table 2 ~ Optimum Distance for Simple Detector 


X 

O 

' Q 

0.1 

0.5 

0.75 

1.0 

1.25 

1.5 

2.0 

3.0 

V° 

0.16 

0.52 

1.09 

1.1 

1.1 

1.08 

1.04 

0.84 


c 



Footnotes 




Chapter 

1 . 

'Helstrom [27] , 



2. Peters and Arguello [23] # 

3. Papoulis [29], p. 155, eq. (5.77) . 

4. Papoulis [2.9], p. 146 . 

5. Helstrora [10], Appendix F. 

6. I. S. Gradshteyn and I. M. Ryzhik [30], p. 961, eq. (8.445) 
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APPENDIX A 


Asymptotic Expansion Approximation of the 
Tail Distribution by the Method of Steepest Descent 

Xn detection theory, the evaluation of the cumulative distri- 
bution for a statistic is often important, and if is required to reach 
a certain, accuracy . For example, when the Neyman-Pearson criterion is 
used, the decision level of the likelihood-ratio detector must be 
calculated from the preassigned false-alarm .probability Qq. As Qq is 
small, there may be a serious error in the decision level if the 
cumulative distribution cannot be accurately calculated. The 
statistic we will discuss in this Appendix is of the" type of the sum 
of N independent random variables and can be written in general as 


N 

E 

i - 1 


i - 1, 2, 


N . 


(Al) 


When {X^} are identical random variables and N is large, the cen- 
tral limit theorem can a ppH e( j. if {X^} are independent 

but not identical, the distribution of Z may be dominated by one of 
the random variables and can be approximated by the Gaussian distri- 
bution when conditions such as Lyapunov’s condition are satisfied. 
On the other hand, the distribution of the statistic Z can be derived 
from the inverse Fourier transform of its characteristic function when 
the inverse integral can be approximated by the asymptotic expansion 
series through the steepest-descent method. For detailed treatment of 
the steepest-descent approximation we- refer to the works of many 


18 A 
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authors For applications of the steepest-descent method in 
statistics we refer to the works of the authors^ Here we will 

discuss the asymptotic expansion approximation of the integral with 
only one saddle point. 


The characteristic function' 1 ' of the random variable Z is 


defined by 


h(io) = E[e ltllZ ] 


e lwZ dF(Z) , 


where F(Zq) = Pr[Z ^ z q 3 is the distribution of the random variable Z, 
The probability that Z exceeds a value Z^ can be written as 


Q(Z ft ) - Pr[Z > ZJ - 1 - ‘ # U (Z - Z) dF(Z) , 


where U (x) is the step function. 

If we take the Fourier transform of Q( Z q) by discarding the 
portion of the integral which oscillates with infinite rapidity, we 
have 


^r[Q(z 0 >] = J 


« -IujZ 

e U Q(Z q ) dZ 


“ 2tt6(w) - h(-w) , 


where 6 (to) is the Dirac delta function. 

The probability Q(Zq) tan then be expressed in terms of h(— w) 
by taking the inverse Fourier transform of (A4). That is 
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Q(Z 0 ) V[Q(Z 0 ) ] 


-iwZ 


0 h (—w) 


where gF, ^are the Fourier transform and inverse Fourier transform 
operators respectively. . 

When the integral on the right-hand side of (A5) cannot be 
evaluated exactly, the method of steepest-descent can be used to 
approximate the tail probability Q(Zq). The integral from (A5) can be 
extended to a contour integral as 


Q( V " 1 ~ 2 s exp[Z 0 -* (s)3 ds * 


The contour of the integration is a straight line running from 
a Q - i°° to a G + i»> for a 0 > 0 in the complex plane s with real values of 
Zq. Here 


$(s) = s + An h(s) 


is the complex phase of the integral, and h(s) = E[e ] is the m*g.f* 

of the random variable Z as defined by (A2) with s = iw- The inte- 

gral in (A6) can be approximated by taking the integration path on the 
complex plane so that the imaginary part of <J>(s) is constant (path of 
the steepest descent) along the path, which also passes through the 

V (s > 

real saddle point upon which the modulus of e u decreases most 
rapidly. The saddle point s^ can be determined from the equation 
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4— d>(s) “ 0 or 
as 

Zq ra - h T (s Q )/h(s 0 ) . ‘ 

Since the origin is also a singular point of the integral, the 
contributions from taking the integral path through the saddle point 
s Q and around the singularity s - 0 cannot be treated separately when 
Sq is small or near to the origin. Rice^ presented a more general 
discussion for cases involving more than one saddle, point. In our case 
the complex phase <j>(s) has only one saddle point as discussed in 
Appendix R. If is analytic in the neighborhood of the origin, 

4( s ) behaves much like a second-order polynomial in s. This suggests 
changing the variable of the integral from s to" u in such a way that 

u) “ u 2 - 2u q u ~ <j>(s) (A9) 

is the new complex phase ^ ^ in the u plane with Uq as the new saddle 
point corresponding to Sq in the s-plane, and «p(uq) - ^( s q)- The new 
saddle point u Q can be determined from (A9) as 

Uq = sign(s 0 ) (-^(Sq))^ } (A10) 

/ * 

where the function sign(x) ~ 1 for x > 0, sign(x) ~ ~ 1 for x < 0, 
so that Uq and Sq have the same sign. <K s q) is the complex phase at 
s *■ Sq and is assumed to be non-positive. Thus, integral on the 
right-hand side of (A6) can be expanded into a uniform asymptotic 



-1 


series of powers Z^ as 


exp [Zq (s) ] da 


~ f (u) exp[Zg(u 2 - 2 Uq u)]du 


-n 


= ^ 0 (Z Q ) + ZLt A n Z o , 

n - 1 


(All) 


where 


f(u) = uds/sdu } 


k 0 ( z 0 ) = / U ~ 1 exp[z Q (u 2 - 2 u u )]du } 


k x (Z 0 ) “ i exp[Z Q (u - 2 u Q u)]du , 


With c* running from a' - i 00 to a f '+ ±°° for a 1 > 0 in the complex u 

plane. A n is the coefficient of the nth term in the . expansion and can 

(13) 

be determined from the Ursell method as discussed by Rice 

Thus, the probability from (A6) can be approximated by the 
uniform asymptotic expansion series as 


Q(Z 0 ) = 1 - B(Z 0 ) - I(Z 0 ) 


(A12) 


where 



E(Z 0 ) 


'l - erf c [(-2 Z Q ♦ (Z^)’*] , for S() < 0 
erf c [ (-2 Z Q <KZ 0 ))* S ] , for s Q > 0 


-and 


Kz 0 ) 


exp[Z Q 4> (s Q ) ] 

(21 


t2n z 0 * v ^(s 0 ) y* k* 

2k n 

-2k+n 



-2 


-1 


z n 4 (2) (*o) '"° 


n = 0 


(-s 0 ) 



- 0 


il » n ( 2 ) J2,H-k 


0 V2 ' k \ 2 + (s 0 ) 


h 


(Z 0 *(e 0 » 


-k 


where 


and 


erfc y * f exp(-f3 2 /2)d3 , 

/ST J 
y 


£,n 


0, for n<£or£=0,n.>l 

1, for ri = £ = 0 


n -1+1 (m+n). x 

-2 -sr*^ m <j) (s 0 ) 


£+1, n+l - /_.n\ti(2)/o \ £,n-m-KL 

n + 1 tn = 1 . (m+2)!4 ( s q) 

(i) =i(f + 1 ) (l +2 ) • • • (I + ”- 1 ). (iL = 1 


A, are the coefficients which can be calculated by the recurrence 
x,n J 

relation through the nth derivative ^(s^) of the complex phase, where 
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4> n (s) « * (^s) k( s )» ^ or n — ^ , (A13) 

and is evaluated at the saddle point s 15 s^. This schema can be easily 
programmed for a digital computer * The tail distribution Q(Zq) is 
obtained by adding up the terras in the asymptotic- expansion given by 
(A 12 ) until they become insignificantly small or they stop decreasing 
and begin to Increase. 

When Z is a discrete random variable, the tail distribution 
can be written as 


Q(z 0 ) - Lj p(Z) 
0 z > z 0 


(A14) 


where p(Z) is the probability of the random variable Z and the m.g.f. 
can be written as 


h d (s) = E[e~ sZ ] = 



-sZ 

e 


(A15) 


The calculation of the tail distribution is simpler and more accurate 

if one first approximates the probability p(Z) and then adds up the 

probabilities for all Z > Zq as given by (A 14). For example, when Z 

takes only integral values \ we can express the probability by the 
(14) 

contour integral and approximate it by the uniform asymptotic 


series as 



1 

2ni 


e 



p(Z) 


Z4> d (s) 


ds 


exp [Z s q) 1 
[2t Z <j» d (2> ( s ) ] h 


CO 






(A16) 


where c , is 
d 


the contour running from a" 


in to a" + in, a" > 0 and 


+d (s) 



£n h , (s) + s » 
a 


(A17) 


The saddle point s Q , the coefficients, A £ ^ 2m and the derivative of the 
complex phase ^(s) can be determined as before. 

To illustrate the applications of the asymptotic-expansion 
approximation, three well-known distributions, Gaussian, gamma, and 
Poisson will be discussed as follows: 


Gaussian distribution 

The p.d.f. of Gaussian distribution is given by 

p(Z) = d— exp [- A- (z - m) 2 ] (A18) 

»^2no 2o 

with m, o as the mean and variance. The m.g.f. h(s) is given by 

a 2 2 

h(s) «= exp [- s m “5- 


2 


3 


(A19) 



The saddle point , can be determined from ® 0 or by (A8) * We 

have 


Sg *= (m - Z q)/o^ « (A20) 

The complex phase . and its nth derivatives evaluated at are 
<Ks 0 ).- Z 0 _1 In h(s 0 ) + s Q = Z 0 _1 (m - Z Q ) 2 /2 B 2 ( 


♦"(■o 5 = ° 2/z o , 

♦ '(•o> " ^ TI (Sq) = 0 } for n _> 2 . (A21) 


Thus, the coefficient ^ n - q ^ or n ^ ® and I(g^) vanishes; 

also, the tail distribution becomes 


Q(Z 0 ) - 1 - E(Z Q ) - erf c 


m - Z, 


> s 0 < 0 


1 - erfc 


m - Z. 


s Q > 0 (A22) 


as it should be for Gaussian distribution. 


Gamma distribution 


The p.d, f . of a gamma distribution is given by 
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p(Z) - ~ (XZ) 6-1 e Xz , (S > 0, X > 0, Z > 0 


otherwise , 


(A23) 


The is given by 


h(s) 


-M' 


(A24) 


The saddle point is determined from (A8) or 


s 0 " z o 3 - x > V* ° • 


(A25) 


The, complex phase and its nth derivative evaluated at s n are 


• ■ $(s 0 ) - z 0 -1 [-e to(e/Z Q X) + 3 - X z 0 3 ( 

♦“(.„> - (-l) n (n-1) I (Z 0 /S) n- ^ for n > 2 . (A26) 

The special case at A = 1, M 1 is the exponential distribution where 


p (Z) « e~ Z , Z > 0 

“ 0 otherwise . (A27) 

The complex phase and its derivatives of the exponential distribution 
evaluated at the saddle point from (A25) and (A26) are 
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1 

f 

1)! for^ n >. 2. (A28) 

The numerical calculation of the tail distribution by using the 
asymptotic expansion series given in (A12) for the exponential distri- 
bution has been carried out and compared with the exact value. The 
percentage of relative error is listed in Table 3. . 

Poisson distribution 

■ p (Z) = e~ X A Z /Z! , Z « 0, 1, . . 

*= 0 otherwise . (A29) 

The m.g.f. is 

h d (s) = exp{ A (e S ~ 1) ] . 

The saddle point is solved from (A8) or 

s Q - in (A/Z ) . (A 31) 

The complex phase and its nth derivatives evaluated at 


(A30) 


<Ks 0 > = z 0 -1 a + ^ z Q ) - 

As q ) = (-I)" Z 0 n_1 (n - 


are 
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d»(s n ) = 1 - A Z ~ 1 + in (A Z ~ 1 ) 

T 0 i 

* n (s 0 ) - (-1)", for n >2 . (A32) 

The tail distribution has been evaluated by first approximating the 
probability p(Z) with the asymptotic expansion series given in (A16) 
and then adding up the probabilities for all Z > Z^. The relative 
error by comparing with the exact value is listed in Table 4. 
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Appendix B 

Iteration Procedure for Finding the 
Saddle Point in the Asymptotic Expansion Series 

In this Appendix we will show that there is only one real 
saddle point of the contour integral given by (A6) in Appendxx A for 
approximating the tail distribution of the optimum statistic g or the 
■threshold statistic g^ from (2.18) or (2.26). The formulas used for 
searching for the saddle point when the Neyman -Pearson criterion ' is 
applied will be given. 

The cumulant-generating function for the ideal detector is 
given by (2.20) from Chapter XI as 

c^Cs) = Jtn h k (s) * “ T I b j [ & (x)[(H(x)) -l]d x ^ 

A 

k - 0, 1 . 


The m.g.f. M(s, g Q ) of the positive statistic g is defined 


-s (g~S 0 ) 

M(s,g Q ) * E[e ] 

r - a( 8-*o ) 


dF k (g) 


= exp[g Q s + c^s)] 

" exp[g Q <J> k (s)3 , k - 0, 1 


by 


(B2) 


and can be expressed as a function of 4> k ( s ) as given (2.48). 
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As we investigate tlie complex phase ^(s) f° r k - 0, 1 and its 
first and second derivatives from (2.53) 

$ k (s) gQ^ a T \ ^ [H(x)] k ~ S £n H(x) d 2 x + 1 , 

• - A 

4> lc Cs) - g 0 _1 a T I h J [H(x)] K S in 2 II (x) d 2 x , 

A 

k « 0, 1 , 

i i it 

we have ^C” 00 ) = - », <|>^( co ) = 1 and $ (s) > 0 for all real values of 

t 

s. Then $ (s) is a non-decreasing function of real s. Furthermore, 
the cumulant-generating function C^.(s) exists in -» < s < 

Therefore,, according to the theorems 6.1 and 6.2 of Daniels 

t 

there is only one real root, which is simple, of ^(s) = 0 for 

gg > 0 and no more real roots otherwise. This is true also for the 

threshold statistic g. . 

u 

When the Neyraan —Pearson strategy is used, the false-alarra 
probability Q q is preassigned. To search for the saddle point s^ under 
H q> we must arbitrarily pick the initial values s^ and s Qp in such a 
way that the false-alarm probability calculated at s^ N or by using 
the asymptotic series from (A 12) will be or and 

q on * Q 0 * Q 0P • 
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A trial value sjj of the saddle point then will be determined from the 


, 8 0P “ 6 ON ( n , 
S 0H Q 0p - Q qn (Q 0 - <W • 


X T 

The trial value of the decision level g Q and probability Qq can be 

calculated by substituting into (2.62) and (2.55) for the ideal 

detector or (2.69) and (2.55) for the threshold detector. If 

Qq > Q q , the values of s op and Q Qp will be replaced by the values 

Sq T and Q 0 T . If Q q T < Q q , the values of s QN and Q Qn will be replaced 

X 

instead. Thus, a new trial value Sq will be obtained according to 

(B3), and the procedure ' will be repeated until the calculated values 
T 

Qq approaches the preassigned value Qq within a tolerable error. 

T 

The decision level g Q is obtained from the final trial value g Q . For 

the ideal detector, the saddle point s p under can be determined 

from (2,51). For the threshold detector, the saddle point s p under 

must, be searched for again by iteration. That is, after the decision 

level g is obtained, we must arbitrarily pick the initial values s 

yo 

and s p p in such a way that the decision levels calculated from (2.69) 

T 

are g N and g p and g N < g G0 < g p . The trial value of s 1 will be 
determined from 


, S 1P ~ S 1N , . 

+ ~ - - ^00"* g N * 


g P " g N 


By substituting the calculated value -s p into (2.69), the trial value 

T T T 

g Q0 is determined. If > g QOj g p and s ip will be replaced by 



m 

and , otherwise g^ and s^ will be replaced instead. A new trial 
value s x T will be determined again from (B4) , Repeat the procedure 
until the calculated value gg^ approaches the decision level ggg within 
a tolerable error. The final trial value will be used to calculate 
the detection probability for the threshold detector by using 
(2.55). 

For calculating the integral, Simpson’s rule has been used 
for the numerical integrations. The number of points used in the 
integration procedure depends on the relative error the system 
requires. In our calculations we use about 50 points at most times. 
More points can be used when higher accuracy is required. Further- 
more, when the saddle point (k = 0, or 1) is near zero, double 

precision may be required to calculate the saddle point so the 

term I^(gg) in (2.55) can be evaluated accurately. 

The typical behavior of the saddle point s^ of the ideal 

detector for a Gaussian image is given in Fig. 27 where the values of 

2 

6q are plotted as a function of signal-to-noise ratio D at = 0.5 

-3 -5 

and 5, with preassigned false-alarm probabilities Qq = 10 and 10 



Figure 27 Saddle point s Q vs. the signal 
2 u 

to-noise ratio D for a Gaussian image; 


N, 


0.5, 5, Q q 
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